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'6/12020 PCTAJS95/13623 
HEMOGLOBIN RECEPTORS FROM NEISSERIAS 

This invention was made with government support under National Institute 
of Health grants ROl AI32493 and ROl AI22933. The U.S. government has certain 
rights to this invention. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention relates to hemoglobin receptor genes and the proteins encoded 
therefrom of certain bacterial species, particularly species of Neisseria bacteria. 
More particularly, this invention relates to hemoglobm receptor genes, polypeptides 
and peptides useful for preparing vaccines and antibodies against Neisseria, and 
methods and means for producing such peptides and polypeptides in vitro. Also 
provided arc diagnostic and therapeutic methods and reagents useful in detecting and 
treating Neisseria infection and methods for developing novel and effective anti- 
Neisseria agents. 

2. Background of the Invention 

The Neisseriae comprise a genus of bacteria that includes two gram-negative 
species of pyogenic cocci pathogenic for humans: Neisseria meningitidis and 
Neisseria gonorrhoeae. N. meningitidis is a major cause of bacterial meningitis in 
humans, especially children. The disease characteristically proceeds from 
asymptomatic carriage of the bacterium in the nasopharynx to invasion of the 
bloodstream and cerebrospinal fluid in susceptible individuals. 

Neisseria meningitidis is one of the leading causes of bacterial meningitis in 
children and healthy adults in the world. The severity of the disease is evidenced 
by the ability of meningococci to cause the death of previously healthy individuals 
in less than 24 hours. N. meningitidis has a polysaccharide capsule whose diversity 
of component antigenic polysaccharide molecules has resulted in the classification of 
ten different serogroups. Of these, group A strains are the classic epidemic strains; 
group B and C are generally endemic strains, but C occasionally causes an epidemic 
outbreak. All known group A strains have the same protein antigens on their outer 
membranes, while group B strains have a dozen serotypes or groupings based on the 
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presence of principal outer membrane protein antigens (as opposed to 
polysaccharides). 

Survival of a pathogen such as A^. meningitidis in a host depends on its ability 
to overcome a battery of host defense mechanisms. One nonspecific host defense 
5 mechanism against microbial intruders is to limit the availability of iron in tissues 
(Weinberg, 1984, Physiological. Rev. 64l 65-102), because iron is a necessary 
nutrient for most microbial pathogens. The vast majority of iron in the human adult 
is located intraceliularly in the form of hemoglobin (76%) or ferritin (23%). The 
remainder can be found extracellularly bound to host iron-binding proteins such as 
10 transferrin and lactoferrin (Otto et al, 1992, Crit. Rev. Microbiol. 18: 217-233). 

Pathogenic bacteria have adapted to this iron-limiting environment by 
developing highly specific and effective iron assimilation systems. A large number 
of these bacteria secrete siderophores, small, non-protein iron chelators which, due 
to their extremely high affinity for iron (III), scavenge trace amounts of iron(III) 
15 from the environment and shuttle the iron back to the bacterial cell (Baggs and 
Neilands, 1987, Microbiol. Rev. 51: 509-518; Braun and Hantke, 1991, in 
Winkehnann (ed.). Handbook of Microbial Iron Chelates, CRC Press: Boca Raton, 
Fla., pp. 107-138.). 

Alternatively, some bacterial pathogens, like Neisseriae species (Archilbald 
20 and DeVoe, 1979, FEMS Microbiol. Lett. 6: 159-162; Mickelson et al. , 1982, Infect. 
Immun. 35: 915-920; Dyer et al., 1987, Infect. Immun. 55: 2171-2175), 
Haemophilus influenzae (Coulton and Pang, 1983, Curr. Microbiol. 9: 93-98; 
Schryvers, 1988, Mol. Microbiol. 2: 467-472; Jarosik etal., 1994, Infect. Immun. 
62: 2470-2477), Vibrio cholerae (Stoebner and Payne, 1988, Infect. Immun. 56: 
25 2891-2895; Henderson and Payne, 1994, J. Bacterial. 176: 3269-3277), Yersiniae 
(Stojiljkovic and Hantke, 1992, EMBO J. U: 4359-4367) and Actinobacillus 
pleuropneumoniae (Gerlach et al., 1992, Infect. Immun. 60: 3253-3261) have 
evolved more sophisticated mechanisms to sequester iron from the host. These 
pathogens can directly bind host's iron-binding proteins such as lactoferrin, 
30 transferrin, and heme-containing compounds, and use them as sole sources of iron. 

The importance of iron in the virulence of A^. meningitidis was demonstrated 
by in vivo studies using mice as the animal model system (Calver et al. , 1976, Can. 
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J. Microbiol. 22: 832-838; Holbien et al, 1981, Infect. Immun. 34: 120-125). 
Specific iron-regulated outer membrane receptors have been shown to be involved 
in the binding and the utilization of lactoferrin- and transferrin-iron in Neisseriae 
(Schryvers and Morris, 1988, Infect. Immun. 56: 1144-1149 and Mol. Microbiol. 2: 
281-288; Legrain et al, 1993, Gene 130: 81-90; Pettersson et al., 1993, Infect. 
Immun. 61: 4724-4733 and 1994. J. Bacteriol. 176: 1764-1766). These receptors 
share significant amino acid similarity and, most probably, also the mechanism of 
iron internalization, with receptors for siderophores and vitamin B12 of other Gram- 
negative bacteria (Comelissen et al., 1993, J. Bacteriol. 174: 5788-5797). In 
contrast, the mechanism by which Neisseriae utilize hemoglobin- and hemin-iron as 
well as the components involved have so far not been described. 

Recently, several proteins with hemoglobin-binding and/or hemin-binding 
activities have been identified in total membranes of iron-limited A^. meningitidis and 
A^. gonorrhoeae. 

Lee and Hill, 1992, J. gen. Microbiol. 138: 2647-2656 disclose the specific 
hemoglobin binding by isolated outer membranes of A^. meningitidis. 

Martek and Lee. 1994, Infect. Immun. 62: 700-703 disclosed that acquisition 
of heme iron by N. meningitidis does not involve meningococcal transferrin-bmding 
proteins. 

Lee, 1994, Microbiol. 140: 1473-1480 describes the biochemical isolation and 
characterization of hemin binding proteins from N. meningitidis. 

The precise role of these proteins in hemin and/or hemoglobin utilization 
remains unclear at present, although these proteins are likely to be components of 
a hemin-utilization system in A^. meningitidis. 

The dependence on host iron stores for Neisseria growth is a potentially 
useftU route towards the development of novel and effective therapeutic intervention 
strategies. Historically, infections of both A^. meningitidis and A^. gonorrhoeae were 
treated chemoprophylactically with sulfonamide drugs. However, widi the 
development of sulfonamide-resistant strains came the necessity of using alternative 
modes of therapy such as antibiotic treatment. More recently, the drug treatment of 
choice includes the administration of high grade penicillin. However, the success 
of antimicrobial treatment is decreased if therapy is not initiated early after infection. 
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Gonococcal infection has also been treated with penicillin, ampicillin, or 
amoxicillin, tetracycline hydrochloride, and spectinomycin. Unfortunately, because 
the incidence of infections due to penicillinase-producing bacteria has increased, 
several new, more expensive B-lactam antibiotics have been used in treatment. 
5 Despite the fact that existing antibiotics have decreased the serious consequences of 
gonorrhea, their use has not lowered the incidence of the infection in the general 
population. 

Prevention of meningococcal disease has been attempted by chemoprophylaxis 
and immunoprophylaxis. At present, rifampin and minocycline are used, but only 
10 for humans in close contact with an infected person as this treatment has a number 
of disadvantages. The only commercially available vaccine against meningococcal 
meningitis has as its major component the bacterial polysaccharide capsule. In adults 
this vaccine protects against serogroups A, C, Y and W135. It is not effective 
against serogroup B, and is ineffective in children against serogroup C. Thus far, 
15 immunoprophylatic preventive treatment has not been available for A^. gonorrhoeae. 

Thus, what is needed are better preventative therapies for meningococcal 
meningitis and gonorrhea including more effective, longer lasting vaccines which 
protect across all of the serogroups of A^. meningitidis and all the serotypes of A^. 
gonorrhoeae. In addition, better methods are need to treat meningococcal and 
20 gonococcal infection. 



SUMMARY OF THE INVENTION 

The present invention relates to the cloning, expression and functional 
characterization of genes encoding bacterial hemoglobin receptor proteins. 

25 Specifically, the invention relates to genes encoding hemoglobin receptor proteins 
from Neisseria species, in paiticular Neisseria meningitidis and N. gonorrhoeae. The 
invention comprises species of nucleic acids having a nucleotide sequence encoding 
novel bacterial hemoglobin receptor proteins. Also provided by this invention is the 
deduced amino acid sequence of the cognate hemoglobin receptor proteins of these 

30 bacterial genes. 

The invention provides nucleic acids, nucleic acid hybridization probes, 
recombinant expression consttnicts capable of expressing the hemoglobin receptor 



-4- 



wo 96/12020 



PCT/US95/13623 



protein of the invention in cultures of transformed cells, preferably bacterial cells, 
and such cultures of transformed bacterial cells that express the hemoglobin receptor 
proteins of the invention. The invention also provides gene knockout vectors for 
inactivating the hemoglobin receptor protein gene in cells, particularly cells of 
5 Neisseria species, via, for example, homologous recombination and other 
mechanisms, and cultures of such hemoglobin receptor protein null mutant cells. 

The invention also provides homogeneous preparations of the bacterial 
hemoglobin receptor proteins of the invention, as well as antibodies against and 
epitopes of the hemoglobin receptor protein. Methods for characterizing this 
10 receptor protein and methods for using the protein in the development of agents 
having pharmacological uses related to this receptor, particularly bactericidal and 
bacteriostatic uses, are also provided by the invention. 

In other embodiments of this invention are provided diagnostic methods and 
reagents encompassing the use of the anti-Neisseria hemoglobin receptor protein 
15 antibodies of the invention. Still further embodiments provided herein include 
therapeutic methods and reagents encompassing the use of the anti-Neisseria 
hemoglobin receptor protein antibodies of the invention. Even more embodiments 
include diagnostic methods and reagents encompassing the use of the Neisseria 
hemoglobin receptor protein-encoding nucleic acids of the invention, as sensitive 
20 probes for the presence of Neisseria infection using nucleic acid hybridization 
techniques and/or in vitro amplification methodologies. Yet additional embodiments 
of the invention include therapeutic methods and reagents encompassing the use of 
the Neisseria hemoglobin receptor protein-encoding nucleic acids of the invention, 
comprising recombinant expression constructs engineered to produce antisense 
25 transcripts of the Neisseria hemoglobin receptor gene and fragments thereof, as well 
as recombinant knockout vectors of the invention. The invention also provides the 
Neisseria hemoglobin receptor protein and epitopes thereof as components of 
vaccines for the development of non-disease associated immunity to pathological 
infection with bacteria of Neisseria species. 
30 In a first aspect, the invention provides a nucleic acid having a nucleotide 

sequence encoding a bacterial hemoglobin receptor protein gene. In a preferred 
embodiment, the bacterial hemoglobin receptor protein gene is isolated from bacteria 



wo 96/12020 



PCT/US95/13623 



of Neisseria species. In a particularly preferred embodiment, the hemoglobin 
receptor protein gene is isolated from Neisseria meningitidis, serotype C. In a 
particular example of this embodiment, the nucleic acid comprises a 3.3 kilobase (kb) 
BammiHindlU fragment of A^. meningitidis genomic DNA. In this embodiment, the 
5 nucleotide sequence comprises an open reading frame of 2376 nucleotides of A^. 
meningitidis genomic DNA encoding 792 amino acids comprising the hemoglobin 
receptor gene. In this embodiment of the invention, the nucleotide sequence of the 
N. meningitidis hemoglobin receptor gene is the sequence depicted in Figure 2 (SEQ 
ID No:l). It will be understood that the A^. meningitidis gene as disclosed herein is 
10 defined, insofar as is necessary, by the amino acid sequence of the protein encoded 
therein, said amino acid sequence being represented in Figure 2 (SEQ. ID No.:2). 
Thus, it will be understood that the particular nucleotide sequence depicted in Figure 
2 (SEQ. ID. No.:l) is but one of a number of equivalent nucleotide sequences that 
encode the hemoglobin receptor protein, due to the degeneracy of the genetic code, 

15 and that all such alternative, equivalent nucleotide sequences are hereby explicitly 
encompassed within the disclosed nucleotide sequences of the invention. Also 
included herein are any mutant or allelic variations of this nucleotide sequence, either 
naturally occurring or the product of in vitro chemical or genetic modification. Each 
such variant will be tmderstood to have essentially the same nucleotide sequence as 

20 the nucleotide sequence of the corresponding A', meningitidis hemoglobin receptor 
protein disclosed herein. 

In another particularly preferred embodiment of this aspect of the invention, 
the hemoglobin receptor protein gene is isolated fi-om Neisseria meningitidis, 
serotype A. In a particular example of this embodiment, the nucleic acid comprises 

25 a 2373 basepair (bp) polymerase chain reaction-amplified fragment of A^. 
meningitidis, serotype A genomic DNA. In this embodiment, the nucleotide 
sequence comprises an open reading fi-ame of 2373 nucleotides of N. meningitidis 
genomic DNA encoding 790 amino acids comprising the hemoglobin receptor gene. 
In this embodiment of the invention, the nucleotide sequence of the N. meningitidis 

30 hemoglobin receptor gene is the sequence depicted in Figure 7 (SEQ ID No:3). It 
will be understood that the N. meningitidis gene as disclosed herein is defined, 
insofar as is necessary, by the amino acid sequence of the protein encoded therein, 
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said amino acid sequence being represented in Figure 7 (SEQ. ID No.: 4). Thus, it 
will be understood that the particular nucleotide sequence depicted in Figure 7 (SEQ. 
ID. No.: 3) is but one of a number of equivalent nucleotide sequences that encode the 
hemoglobin receptor protein, due to the degeneracy of the genetic code, and that all 
5 such alternative, equivalent nucleotide sequences are hereby explicitly encompassed 
within the disclosed nucleotide sequences of the invention. Also included herein are 
any mutant or allelic variations of this nucleotide sequence, either naturally occurring 
or the product of in vitro chemical or genetic modification. Each such variant will 
be understood to have essentially the same nucleotide sequence as the nucleotide 
10 sequence of the corresponding N. meningitidis hemoglobin receptor protein disclosed 
herein. 

In another particularly preferred embodiment of this aspect of the invention, 
the hemoglobin receptor protein gene is isolated from Neisseria meningitidis, 
serotype B. In a particular example of this embodiment, the nucleic acid comprises 

15 a 2376 basepair (bp) polymerase chain reaction-amplified fragment of N. 
meningitidis, serotype A genomic DNA. In this embodiment, the nucleotide 
sequence comprises an open reading frame of 2373 nucleotides of N. meningitidis 
genomic DNA encoding 791 amino acids comprising the hemoglobin receptor gene. 
In this embodiment of the invention, the nucleotide sequence of the N. meningitidis 

20 hemoglobin receptor gene is the sequence depicted in Figure 8 (SEQ ID No: 5). It 
will be understood that the N. meningitidis gene as disclosed herein is defined, 
insofar as is necessary, by the amino acid sequence of the protein encoded therein, 
said amino acid sequence being represented m Figure 8 (SEQ. ID No. : 6). Thus, it 
will be understood that the particular nucleotide sequence depicted in Figure 8 (SEQ. 

25 ID. No. : 5) is but one of a number of equivalent nucleotide sequences that encode the 
hemoglobin receptor protein, due to the degeneracy of the genetic code, and that all 
such alternative, equivalent nucleotide sequences are hereby explicitly encompassed 
within the disclosed nucleotide sequences of the invention. Also included herein are 
any mutant or allelic variations of this nucleotide sequence, either naturally occurring 

30 or the product of in vitro chemical or genetic modification. Each such variant will 
be understood to have essentially the same nucleotide sequence as the nucleotide 
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sequence of the corresponding A^. meningitidis hemoglobin receptor protein disclosed 
herein. 

In yet other preferred embodiments, the invention provides nucleic acid 
encoding a hemoglobin receptor protein gene isolated from Neisseria gonorrhoeae. 
In a particular example of this embodiment, the nucleic acid comprises a 2378 
basepair (bp) polymerase chain reaction-amplified fragment of A^. gonorrhoeae 
genomic DNA. In this embodiment, the nucleotide sequence comprises an open 
reading frame of 2373 nucleotides of N. gonorrhoeae genomic DNA encoding 791 
amino acids comprising the hemoglobin receptor gene. In fliis embodiment of the 
invention, the nucleotide sequence of the N. gonorrhoeae hemoglobin receptor gene 
is the sequence depicted in Figure 9 (SEQ ID No:7). It will be understood that the 
A^. gonorrhoeae gene as disclosed herein is defined, insofar as is necessary, by the 
amino acid sequence of the protein encoded therein, said amino acid sequence being 
represented in Figure 9 (SEQ. ID No.: 8). Thus, it will be understood that the 
particular nucleotide sequence depicted in Figure 9 (SEQ. ID. No.:7) is but one of 
a number of equivalent nucleotide sequences that encode the hemoglobin receptor 
protein, due to the degeneracy of the genetic code, and that all such alternative, 
equivalent nucleotide sequences are hereby explicitly encompassed within the 
disclosed nucleotide sequences of the invention. Also included herein are any mutant 
or allelic variations of this nucleotide sequence, either namrally occurring or the 
product of in vitro chemical or genetic modification. Each such variant will be 
understood to have essentially the same nucleotide sequence as the nucleotide 
sequence of the corresponding N. gonorrhoeae hemoglobin receptor protein disclosed 
herein. 

The invention also provides bacterial hemoglobin receptor proteins. In a 
preferred embodunent, the bacterial hemoglobm receptor protein is isolated from 
bacteria of Neisseria species. In a particularly preferred embodiment, the 
hemoglobin receptor protein is isolated from Neisseria meningitidis. In a particular 
example of this embodiment, the protein is derived fi-om A', meningitidis, serotype 
C and comprises an amino acid sequence of 792 amino acids. In this embodiment 
of the invention, the amino acid sequence of the A^. meningitidis, serotype C 
hemoglobin receptor protein is the sequence depicted in Figure 2 (SEQ ID No:2). 
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In another example of this embodiment, the protein is derived from N. meningitidis, 
serotype A and comprises an amino acid sequence of 790 amino acids. In this 
embodiment of die invention, the amino acid sequence of the N. meningitidis, 
serotype A hemoglobin receptor protein is the sequence depicted in Figure 7 (SEQ 
5 ID No:4). In yet another example of this embodiment, the protein is derived from 
N. meningitidis, serotype B and comprises an amino acid sequence of 791 amino 
acids. In this embodiment of the invention, the amino acid sequence of the A^. 
meningitidis, serotype B hemoglobin receptor protein is the sequence depicted in 
Figure 8 (SEQ ID No:6). The invention also provides hemoglobin receptor protein 

10 derived from N. gonorrhoeae. In this embodiment of the invention, the protein 
comprises an amino acid sequence of 791 amino acids, and the amino acid sequence 
of the N. gonorrhoeae hemoglobin receptor protein is the sequence depicted in 
Figure 9 (SEQ ID No: 8). Also explicitly encompassed within the scope of this 
invention are related bacterial hemoglobin receptor proteins, particularly such 

15 proteins isolated from Neisseria species, having essentially the same amino acid 
sequence and substantially the same biological properties as the hemoglobin receptor 
protein encoded by the N. meningitidis and N. gonorrhoeae nucleotide sequences 
described herein. 

In another aspect, the invention provides a homogeneous preparation of an 
20 approximately 85.5 kiloDalton (kD) bacterial hemoglobin receptor protein or 
derivative thereof, said size bemg understood to be the size of the protein before any 
post-translational modifications thereof. Also provided is a 90kD embodiment of the 
receptor as determined by sodium dodecyl sulfate/ polyacrylamide gel electrophoresis 
under reducii^ conditions. In a preferred embodiment, the bacterial hemoglobin 
25 receptor protein is isolated from bacteria of Neisseria species. In a particularly 
preferred embodiment, the hemoglobin receptor protein is isolated from Neisseria 
meningitidis. In one embodiment of this aspect of the invention, the protein is 
isolated from N. meningitidis, serotype C and the amino acid sequence of the 
bacterial hemoglobin receptor protein or derivative thereof preferably is the amino 
30 acid sequence of the hemoglobin receptor protein shown in Figure 2 (SEQ ID No:2). 
In a second embodiment of diis aspect of the invention, the protein is isolated from 
N. meningitidis, serotype A and the amino acid sequence of the bacterial hemoglobin 
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receptor protein or derivative thereof preferably is the amino acid sequence of the 
hemoglobin receptor protein shown in Figure 7 (SEQ ID No:4). In a third 
embodunent of this aspect of the invention, the protein is isolated from A^. 
meningitidis, serotype B and the amino acid sequence of the bacterial hemoglobin 
receptor protein or derivative thereof preferably is the amino acid sequence of the 
hemoglobin receptor protein shown in Figure 8 (SEQ ID No:6). The invention also 
provides a homogeneous preparation of a bacterial hemoglobin receptor protein 
isolated from A^. gonorrhoeae. In a preferred embodiment, the amino acid sequence 
of the bacterial hemoglobin receptor protein or derivative thereof preferably is the 
amino acid sequence of the hemoglobin receptor protein shown in Figure 9 (SEQ ID 
No:8). 

This invention provides nucleotide probes derived from the nucleotide 
sequences herein provided. The invention includes probes isolated from either 
complementary DNA (cDNA) copies of bacterial messenger RNA (mRNA) or 
bacterial genomic DNA (gDNA), as well as probes made synthetically or by in vitro 
amplification methods using the sequence information provided herein. The 
invention specifically includes but is not limited to oligonucleotide, nick-translated, 
random primed, or in vitro amplified probes made using cDNA or genomic clones 
embodying the invention, and oligonucleotide and other synthetic probes synthesized 
chemically using the nucleotide sequence information of cDNA or genomic clone 
embodiments of the invention. 

It is a further object of this invention to provide such nucleic acid 
hybridization probes to detect the presence of bacteria of Neisseria species, 
particularly N. meningitidis and N. gonorrhoeae, in a biological sample in the 
diagnosis of a Neisseria infection in a human. Such a biological sample preferably 
includes blood, urine, semen, mucus, cerebrospinal fluid, peritoneal fluid and ascites 
fluids, as well as cell scrapings from the epithelium of the mouth, urethra, anus and 
rectum, and other organs. 

The present invention also includes peptides encoded by the nucleotide 
sequences comprising the nucleic acid embodiments of the invention. The invention 
includes either naturally occurring or synthetic peptides which may be used as 
antigens for the production of hemoglobin receptor protein-specific antibodies. The 
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invention also comprises such antibodies, preferably monoclonal antibodies, and cells 
and cultures of cells producing such antibodies. 

Thus, the invention also provides antibodies against and epitopes of bacterial 
hemoglobin receptor proteins of the invention. It is an object of the present 
5 invention to provide antibodies that are immunologically reactive to the bacterial 
hemoglobin receptor proteins of the invention. It is a particular object to provide 
monoclonal antibodies against these bacterial hemoglobin receptor proteins. In a 
preferred embodiment, antibodies provided are raised against bacterial hemoglobin 
receptor protein isolated from bacteria of Neisseria species. In a particularly 
10 preferred embodiment, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria meningitidis serotypes A, B or C. In additional 
particularly preferred embodiment, such antibodies are specific for the hemoglobin 
receptor protein isolated from Neisseria gonorrhoeae. 

Hybridoma cell lines producing such antibodies are also objects of the 
15 invention. It is envisioned at such hybridoma cell lines may be produced as the 
result of fusion between a non-immunoglobulin producing mouse myeloma cell line 
and spleen cells derived from a mouse immunized with purified hemoglobin receptor 
protein or a cell expressing antigens or epitopes of bacterial hemoglobin receptor 
proteins of the invention. The present invention also provides hybridoma cell lines 
20 that produce such antibodies, and can be injected into a living mouse to provide an 
ascites fluid from the mouse that is comprised of such antibodies. In a preferred 
embodiment, antibodies provided are raised against bacterial hemoglobin receptor 
protein isolated from bacteria of Neisseria species. In a particularly preferred 
embodiment, such antibodies are specific for the hemoglobin receptor protein isolated 
25 from Neisseria meningitidis, serotypes A, B or C. In additional particularly 
preferred embodiment, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria gonorrhoeae. 

It is a further object of the invention to provide immunologically-active 
epitopes of the bacterial hemoglobin receptor proteins of the invention. Chimeric 
30 antibodies immunologically reactive against the bacterial hemoglobin receptor 
proteins of the invention are also within the scope of this mvention. In a preferred 
embodiment, antibodies and epitopes provided are raised against or derived from 



- 11 - 



96/12020 



PCT/US95/13623 



bacterial hemoglobin receptor protein isolated from bacteria of Neisseria species. 
In a particulariy preferred embodiment, such antibodies and epitopes are specific for 
the hemoglobin receptor protein isolated from Neisseria meningitidis, serotypes A, 
B or C. In additional particularly preferred embodiment, such antibodies and 
epitopes are specific for the hemoglobin receptor protein isolated from Neisseria 
gonorrhoeae. 

The present invention provides recombinant expression constructs comprising 
a nucleic acid encoding a bacterial hemoglobin receptor protein wherein the construct 
is capable of expressing the encoded hemoglobin receptor protein in cultures of cells 
transformed with the construct. Preferred embodiments of such constructs comprise 
the N. meningitidis, serotype C hemoglobin receptor gene depicted in Figure 2 (SEQ 
ID No.:l), such constructs being capable of expressmg the bacterial hemoglobin 
receptor protein encoded therein in cells transformed with the construct. Additional 
preferred embodiments of such constructs comprise the A^. meningitidis, serotype A 
hemoglobin receptor gene depicted in Figure 7 (SEQ ID No.:3). such constructs 
being capable of expressing the bacterial hemoglobm receptor protein encoded 
therein in cells transformed with the construct. Further additional preferred 
embodiments of such constructs comprise the N. meningitidis, serotype B hemoglobin 
receptor gene depicted in Figure 8 (SEQ ID No.:5), such constructs being capable 
of expressing the bacterial hemoglobin receptor protein encoded therein in cells 
transformed with the construct. The invention also provides recombinant expression 
constructs encoding a hemoglobin receptor protein gene isolted from ZN. 
gonorrhoeae. In a particularly preferred embodiment, such constructs comprise the 
N. gonorrhoeae hemoglobin receptor gene depicted in Figure 9 (SEQ ID No. ;7), the 
constructs being capable of expressing the bacterial hemoglobin receptor protein 
encoded therein in cells transformed with the construct. 

The invention also provides cultures of cells, preferably bacterial cells, having 
been transformed with the recombinant expression constructs of the invention, each 
such cultures being capable of and in fact expressing the bacterial hemoglobin 
receptor protein encoded in the transforming construct. 

The present invention also includes within its scope protein preparations of 
prokaryotic cell membranes containing the bacterial hemoglobin receptor protein of 
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the invention, derived from cultures of prokaryotic cells transformed with the 
recombinant expression constructs of the invention. 

The invention also provides diagnostic reagents and methods for using such 
reagents for detecting the existence of an infection in a human, with bacteria of a 
5 Neisseria species. In preferred embodiments, such diagnostic reagents comprise 
antibodies that are immunologically reactive with a bacterial hemoglobin receptor 
protein. In a preferred embodiment, such antibodies are raised against a bacterial 
hemoglobin receptor protein isolated from bacteria of Neisseria species. In a 
particularly preferred embodiment, such antibodies are specific for the hemoglobin 
10 receptor protein isolated from Neisseria meningitidis, serotypes A, B or C. In 
additional particularly preferred embodiments, such antibodies are specific for the 
hemoglobin receptor protein isolated from Neisseria gonorrhoeae. 

In yet another embodiment of this aspect of the invention are provided 
diagnostic reagents and methods for using such reagents wherein said reagents are 
15 nucleic acid hybridization probes comprising a bacterial hemoglobin receptor gene. 
In a preferred embodiment, the bacterial hemoglobin receptor protein gene is isolated 
from bacteria of Neisseria species. In a particularly preferred embodiment, the 
hemoglobin receptor protein gene is isolated from Neisseria meningitidis. In 
particular examples of this embodiment of the invention, the nucleic acid probes 
20 comprise a specifically-hybridizing fragment of a 3.3 kilobase (kb) BamHI/H/ndlll 
fragment of N. meningitidis, serotype C genomic DNA. In this embodiment, the 
nucleotide sequence comprises all or a specifically-hybridizing fragment of an open 
reading ft^e of 2376 nucleotides of N. meningitidis, serotype C genomic DNA 
encoding 792 amino acids comprising the hemoglobin receptor gene. In this 
25 embodiment of the invention, the nucleotide sequence of the N. meningitidis, 
serotype C hemoglobin receptor gene is the sequence depicted in Figure 2 (SEQ ID 
No:l). In another example of this embodiment of the invention, the nucleic acid 
probes comprise a specifically-hybridizing fragment of a 2373bp, polymerase chain 
reaction-amplified fragment of N. meningitidis, serotype A genomic DNA. In this 
30 embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 
fragment of an open reading frame of 2370 nucleotides of N. meningitidis, serotype 
A genomic DNA encoding 790 amino acids comprising the hemoglobin receptor 
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gene. In this embodiment of the invention, the nucleotide sequence of the A^. 
meningitidis, serotype A hemoglobin receptor gene is the sequence depicted in Figure 
7 (SEQ ID No:3). In yet another example of this embodiment of the invention, the 
nucleic acid probes comprise a specifically-hybridizing fragment of a 2376bp, 
polymerase chain reaction-amplified fragment of A^. meningitidis, serotype B genomic 
DNA. In this embodiment, the nucleotide sequence comprises all or a specifically- 
hybridizing fragment of an open reading frame of 2373 nucleotides of A^. 
meningitidis, serotype B genomic DNA encoding 791 amino acids comprising the 
hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the N. meningitidis, serotype B hemoglobin receptor gene is the 
sequence depicted in Figure 8 (SEQ ID No:5). The invention also provides nucleic 
acid hybridization probes comprismg a bacterial hemoglobin receptor gene isolated 
from A^. gonorrhoeae. In a preferred embodiment of this aspect of the invention, the 
nucleic acid probes comprise a specifically-hybridizing fragment of a 2378bp, 
polymerase chain reaction-amplified fi-agment of A^. gonorrhoeae genomic DNA. In 
this embodiment, the nucleotide sequence comprises all or a specifically-hybridizing 
fragment of an open reading frame of 2373 nucleotides of A^. gonorrhoeae genomic 
DNA encoding 791 amino acids comprising the hemoglobm receptor gene. In this 
embodiment of the invention, the nucleotide sequence of the N. gonorrhoeae 
hemoglobin receptor gene is the sequence depicted in Figure 9 (SEQ ID No:7). It 
will be understood that the term "specifically-hybridizing" when used to describe a 
fragment of a nucleic acid encoding a bacterial hemoglobin receptor gene is intended 
to mean that nucleic acid hybridization of such a fragment is stable under high 
stringency conditions of hybridization and washing as the term "high stringency" 
would be understood by those having skill in the molecular biological arts. 

Also provided by the invention are therapeutic agents and methods for using 
such agents for treating the an infection in a human, with bacteria of a Neisseria 
species. In preferred embodiments, such agents comprise antibodies that are 
immunologically reactive with a bacterial hemoglobin receptor protein. In a 
preferred embodiment, such antibodies are raised against a bacterial hemoglobin 
receptor protein isolated from bacteria of Neisseria species. In a particularly 
preferred embodiment, such antibodies are specific for the hemoglobin receptor 
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protein isolated from Neisseria meningitidis, serotypes A, B or C. In additional 
preferred embodiments, such antibodies are specific for the hemoglobin receptor 
protein isolated from Neisseria gonorrhoeae. Therapeutic agents provided in this 
aspect of the invention comprise such antibodies in a pharmaceutically-acceptable 
5 carrier, along with appropriate adjuvants and the like. In additional embodiments, 
such antibodies are covalently conjugated to a bactericidal or bacteriostatic agent 
effective against bacteria of Neisseria species, preferably N. meningitidis and A^. 
gonorrhoeae. 

In yet another embodiment of this aspect of the invention are provided 

10 therapeutic reagents and methods for using such reagents wherein said reagents 
comprise recombinant expression constructs of the invention, or a homologue thereof 
that expresses the nucleic acid encoding a hemoglobin receptor in an antisense 
orientation. In a preferred embodiment, the bacterial hemoglobin receptor protein 
gene is isolated from bacteria of Neisseria species. In a particularly preferred 

15 embodiment, the hemoglobin receptor protein gene is isolated from Neisseria 
meningitidis. In particular examples of this embodiment of the invention, the nucleic 
acids comprise a specifically-hybridizing fragment of a 3.3 kilobase (kb) 
5a/nHI///indIII fragment of N. meningitidis, serotype C genomic DNA. In this 
embodiment, the nucleotide sequence comprises all or a specifically-hybridizmg 

20 fragment of an open reading frame of 2376 nucleotides of N. meningitidis, serotype 
C genomic DNA encoding 792 amino acids comprising the hemoglobin receptor 
gene. In this embodiment of the invention, the nucleotide sequence of the N. 
meningitidis, serotype C hemoglobin receptor gene is the sequence depicted in Figure 
2 (SEQ ID No:l). In another example of this embodiment of the invention, the 

25 nucleic acid probes comprise a specifically-hybridizing firagment of a 2373bp, 
polymerase chain reaction-amplified fragment of N. meningitidis, serotype A 
genomic DNA. In this embodiment, the nucleotide sequence comprises all or a 
specifically-hybridizing fragment of an open reading frame of 2370 nucleotides of 
N. meningitidis, serotype A genomic DNA encoding 790 amino acids comprising the 

30 hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the N. meningitidis, serotype A hemoglobm receptor gene is the 
sequence depicted in Figure 7 (SEQ ID No:3). In yet another example of this 
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embodiment of the invention, the nucleic acid probes comprise a specifically- 
hybridizing fragment of a 2376bp, polymerase chain reaction-amplified fragment of 
A^. meningitidis, serotype B genomic DNA. In this embodiment, the nucleotide 
sequence comprises all or a specifically-hybridizing fragment of an open reading 
5 frame of 2373 nucleotides of N. meningitidis, serotype B genomic DNA encoding 
791 amino acids comprising the hemoglobin receptor gene. In this embodiment of 
the invention, the nucleotide sequence of the A'^. meningitidis, serotype B hemoglobin 
receptor gene is the sequence depicted in Figure 8 (SEQ ID No: 5). The invention 
also provides recombinant expression constructs of the invention, or a homologue 

10 thereof that expresses the nucleic acid encoding a hemoglobin receptor in an 
antisense orientation, wherein the nucleic acid encodes a bacterial hemoglobin 
receptor gene isolated from N. gonorrhoeae. In a preferred embodiment of this 
aspect of the invention, the nucleic acid probes comprise a specifically-hybridizing 
fragment of a 2378bp, polymerase chain reaction-amplified fragment of N. 

15 gonorrhoeae genomic DNA. In this embodiment, the nucleotide sequence comprises 
all or a specifically-hybridizing fragment of an open reading frame of 2373 
nucleotides of N. gonorrhoeae genomic DNA encoding 791 amino acids comprising 
the hemoglobin receptor gene. In this embodiment of the invention, the nucleotide 
sequence of the A', gonorrhoeae hemoglobin receptor gene is the sequence depicted 

20 in Figure 9 (SEQ ID No:7). 

The invention also provides a method for screening compounds for their 
ability to inhibit, facilitate or modulate the biochemical activity of a bacterial 
hemoglobin receptor protein of the invention, for use in the in vitro screening of 
novel agonist and antagonist compounds and novel bactericidal and bacteriostatic 

25 agents specific for the hemoglobin receptor protein. In preferred embodiments, cells 
transformed with a recombinant expression construct of the invention are contacted 
with such a compound, and the bindmg capacity of the compounds, as well as the 
effect of the compound on binding of other, known hemoglobin receptor agonists 
such as hemoglobin and hemin, and antagonists, is assayed. Additional preferred 

30 embodiments comprise quantitative analyses of such effects. 

The present invention is also useful for the detection of bactericidal and/or 
bacteriostatic analogues, agonists or antagonists, known or unknown, of a bacterial 
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hemoglobin receptor protein, preferably derived from bacteria of Neisseria species, 
most preferably isolated from N. meningitidis, wherein such compounds are either 
naturally occurring or embodied as a drug. 

The invention also provides vaccines for immunizing a human against 
5 infection with pathogenic bacteria of Neisseria species, the vaccines comprising the 
hemoglobin binding proteins of the invention or antigenic fragments thereof. In a 
preferred embodiment, the vaccines of the invention comprise cells expressing a 
hemoglobin receptor binding protein of the invention, or an antigenic fragment 
thereof, preferably wherein said cells are attenuated varieties of cells adapted for 

10 growth in humans, i.e., wherein such cells are non-pathogenic and do not cause 
bactermia, endotoxemia or sepsis. Examples of such attenuated varieties of cells 
include attenuated strains of Salmonella species, for example Salmonella typhi and 
Salmonella typhimurium, as well as other attenuated bacterial species. Also provided 
by the invention are recombinant expression constructs as disclosed herein useful per 

15 se as vaccines, for introduction into an animal and production of an immunologic 
response to bacterial hemoglobin receptor protein antigens encoded therein. 

Specific preferred embodiments of the present invention will become evident 
from the following more detailed description of certain preferred embodiments and 
the claims. 

20 

DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects of the present invention, the various features 
thereof, as well as the invention itself may be more fully understood from the 
following description, when read together with the accompanying drawings in which: 
25 Figure 1 is a schematic drawing of the restriction enzyme digestion map of 

a N. meningitidis cosmid clone and subclones thereof derived as described in 
Example 2. 

Figure 2 illustrates the nucleotide (SEQ ID No.:l) and deduced amino acid 
(SEQ ID No.: 2) sequences of the N. meningitidis hemoglobin receptor protein 
30 encoded in a 3.3 kb Bamm/Hindm DNA fragment. 

Figure 3 presents a photograph of a stained SDS/ 10% PAGE electrophoresis 
gel showing the results of in vitro expression of the N. meningitidis hemoglobin 
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receptor gene product as an approximately 90 kilodalton protein, and /3-lactamase 
protein having a molecular weight of about 30.0 kilodaltons used as a molecular 
weight marker. 

Figure 4 presents an amino acid sequence comparison between portions of the 
A^. meningitidis transferrin receptor Tbpl (SEQ ID No.:9), the N. meningitidis 
lactoferrin receptor LbpA (SEQ ID No.: 10), and N. meningitidis hemoglobin 
receptor HmbR (SEQ ID No.:2). 

Figure 5 illustrates Southern hybridization analysis of chromosomal DNA 
from A^. meningitidis 8013 and the MCSOUhmbR mutant using a BamKl-Saa 
fragment of the hmb gene as probe labeled using a DIG nonradioactive DNA 
labelling and detection kit (Boehringer Mannheim Biochemicals, Indianapolis, IN). 
Lane 1 contains DNA from//, meningitidis strain MC8013, digested with Clal; lane 
2 is MC803lhmbR DNA digested with Clal; lane 3, is MC8013 DNA digested with 
BamHl and 5a/I; and lane 4 is MC8013/im*i? DNA digested with BamUl and Safl. 

Figure 6 is a graph describing the course of infection using A^. meningitidis 
wild type (MC8013) and HmbR mutant strains in an in vivo rat infant infection 
model. Each strain was injected intraperitoneally (2 x 10* CFU) into three infant 
inbred Lewis rats. The results represent the average of two similarly-performed 
experiments. 

Figure 7 illustrates the nucleotide (SEQ ID No.: 3) and deduced amino acid 
(SEQ ID No.:4) sequences of the N. meningitidis, serotype A hemoglobin receptor 
protein encoded on a 2373bp polymerase chain reaction-amplified DNA fragment. 

Figure 8 illustrates the nucleotide (SEQ ID No.:5) and deduced amino acid 
(SEQ ID No.:6) sequences of the N. meningitidis, serotype B hemoglobin receptor 
protein encoded on a 2376bp polymerase chain reaction-amplified DNA fragment. 

Figure 9 illustrates the nucleotide (SEQ ID No.: 7) and deduced amino acid 
(SEQ ID No.: 8) sequences of the A^. gonorrhoeae hemoglobin receptor protein 
encoded on a 2376bp polymerase chain reaction-amplified DNA fragment. 

Figure 10 represents a schematic of a nucleic acid sequence comparison 
between the hemoglobin receptor proteins derived from A^. meningitidis, serotypes 
A (SEQ ID No.:3), B (SEQ ID No.:5) and C (SEQ ID No.:l) and from N. 
gonorrhoeae (SEQ ID No.:7), wherein the direction of trascription of the genes is 
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in the direction of the arrow, and the following abbreviations refer to restriction 
endonuclease sites: H represents HindlU; N represents Notl; Bg represents Bs 
represents BssRl; Nr represents Nrul; CI represents Clal; P represents Pstl; Sa 
represents Sad; Av represents Aval; B represents BamRl; S represents 5a/I; EV 
represents EcoRV; Sh represents SphI; and Sy represents Styl. 

Figure 11 presents an amino acid sequence comparison between the 
hemoglobin receptor proteins derived from N. meningitidis, serotypes A (SEQ ID 
No.:4), B (SEQ ID No.:6) and C (SEQ ID No.:2) and from N. gonorrhoeae (SEQ 
ID No.:8). 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The term "bacterial hemoglobin receptor" as used herein refers to bacterial 
proteins comprising the outer membrane of Gram negative bacteria, which 
specifically mediate transit of hemoglobin-derived hemin, as well as hemin from 
other sources, through the outer membrane of such bacteria and into the periplasmic 
space. The bacterial hemoglobin receptor proteins of the invention are characterized 
by, first, an amino acid sequence that is essentially the sequence depicted in Figures 
2 (SEQ ID No.:2), 7 (SEQ ID No.:4), 8 (SEQ ID No.:6) and 9 (SEQ ID No.:8). 
The bacterial hemoglobin receptor proteins of the invention are further characterized 
by having substantially the same biological activity as a protein having the amino 
acid sequence depicted in Figures 2 (SEQ ID No. :2), 7 (SEQ ID No. :4), 8 (SEQ ID 
No.:6) and 9 (SEQ ID No.:8). This definition is intended to encompass naturally- 
occurring variants and mutant proteins, as well as genetically engineered variants 
made by man. 

Cloned, isolated and purified nucleic acid provided by the present invention 
may encode a bacterial hemoglobin receptor protein of any Neisseria species of 
origin, including, most preferably, Neisseria meningitidis species and serotypes 
thereof and Neisseria gonorhoeae species. 

The nucleic acid hybridization probes provided by the invention comprise 
DNA or RNA having all or a specifically-hybridizing fragment of the nucleotide 
sequence of the hemoglobin receptor protein as depicted in Figures 2 (SEQ ID 
No.:l), 7 (SEQ ID No.:3), 8 (SEQ ID No.:5) and 9 (SEQ ID No. :7), or any portion 
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thereof effective in nucleic acid hybridization. Mixtures of such nucleic acid 
hybridization probes are also within the scope of this embodiment of the invention. 
Nucleic acid probes as provided herein are useful for detecting the presence of a 
bacteria, inter alia, in a human as the result of an infection, in contaminated 
biological samples and specimens, in foodsmffs and water supplies, or in any 
substance that may come in to contact with the human. Specific hybridization will 
be understood to mean that the nucleic acid probes of the invention are capable of 
forming stable, specific hybridization to bacterially-derived DNA or RNA under 
conditions of high stringency, as the term "high stringency" would be understood by 
those with skill in the art {see, for example, Sambrook et al. , 1989, Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. and Hames and Higgins, eds., 1985, Nucleic Acid Hybridization . IRL 
Press, Oxford, U.K.). Hybridization will be understood to be accomplished using 
well-established techniques, including but not limited to Southern blot hybridization, 
Northern blot hybridization, in situ hybridization and Southern hybridization to 
polymerase chain reaction product DNAs. The invention will thus be understood to 
provide oligonucleotides, specifically, pairs of oligonucleotides, for use as primers 
in support of in vitro amplification of bacterial hemoglobin receptor genes and 
mRNA transcripts. 

The production of proteins such as bacterial hemoglobin receptor proteins 
from cloned genes by genetic engineering means is well known in this art. The 
discussion which follows is accordingly intended as an overview of this field, and is 
not intended to reflect the full state of the art. It wUl be understood ftx)m the 
following discussion that the hemoglobin receptor protein genes of this invention are 
particularly advantageous, smce expression of such proteins by bacteria, includmg 
non-Neisseria species of bacteria, can complement certain auxotrophic mutants of 
said transformed bacteria otherwise unable to subsist absent supplementation of the 
growth media with iron (HI). 

DNA encoding a bacterial hemoglobin receptor protein, in view of the instant 
disclosure, by chemical synthesis, by screening reverse transcripts of mRNA from 
appropriate cells, by screening genomic libraries from appropriate cells, or by 
combinations of these procedures, as illustrated below. Screening of mRNA or 
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genomic DNA may be carried out with oligonucleotide probes generated from the 
nucleic acid sequence information from the bacterial hemoglobin receptor protein 
disclosed herein. Probes may be labeled with a detectable group such as a 
fluorescent group, a radioactive atom or a chemiluminescent group in accordance 
with know procedures and used in conventional hybridization assays, as described 
in greater detail in the Examples below. In the alternative, bacterial hemoglobin 
receptor protein-encoding nucleic acids may be obtained by use of the polymerase 
chain reaction (PGR) procedure, using appropriate pairs of PGR oligonucleotide 
primers corresponding to nucleic acid sequence information derived from a bacterial 
hemoglobin receptor protein as provided herein. See U.S. Patent Nos. 4,683,195 
to Mullis et al. and 4,683,202 to MuUis, as specifically disclosed herein in Example 
9 below. In another alternative, such bacterial hemoglobin receptor protein-encoding 
nucleic acids may be isolated from auxotrophic cells transformed with a bacterial 
hemoglobin receptor protein gene, thereby relieved of the nutritional requirement for 
uncomplexed iron (III). 

Any bacterial hemoglobin receptor protein of the invention may be 
synthesized in host cells transformed with a recombinant expression construct 
comprising a nucleic acid encoding the bacterial hemoglobin receptor protein. Such 
recombinant expression constructs can also be comprised of a vector that is a 
replicable DNA construct. Vectors are used herein either to amplify DNA encoding 
a bacterial hemoglobin receptor protein and/or to express DNA encoding a bacterial 
hemoglobin receptor protein. For the purposes of this invention, a recombinant 
expression construct is a replicable DNA construct in which a nucleic acid encoding 
a bacterial hemoglobin receptor protein is operably linked to suitable control 
sequences capable of effecting the expression of the bacterial hemoglobin receptor 
protein in a suitable host cell. 

The need for such control sequences will vary depending upon the host cell 
selected and the transformation method chosen. Generally, bacterial control 
sequences include a transcriptional promoter, an optional operator sequence to 
control transcription, a sequence encoding suitable mRNA ribosomal binding sites 
(the Shine-Delgamo sequence), and sequences which control the termination of 
transcription and translation. Amplification vectors do not require expression control 
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domains. All that is needed is the ability to replicate in a host, usually conferred by 
an origin of replication, and a selection gene to facilitate recognition of 
transformants. See, Sambrook et al., 1989, ibid. 

Vectors useftil for practicing the present invention include plasmids and virus- 
derived constructs, including phage and particularly bacteriophage, and integratable 
DNA fragments (i.e., fragments integratable into the host genome by homologous 
recombination). The vector replicates and functions independently of the host 
genome, or may, in some instances, integrate into the genome itself. Suitable 
vectors v/ill contain replicon and control sequences which are derived from species 
compatible with the intended expression host. A preferred vector is pLAFR2 (see 
Riboli et al., 1991, Microb. Pathogen. 10: 393-403). 

Transformed host cells are cells which have been transformed or transfected 
with recombinant expression constructs made using recombinant DNA techniques and 
comprising nucleic acid encoding a bacterial hemoglobin receptor protein. Preferred 
host cells are cells of Neisseria species, particularly A^. meningitidis, as well as 
Salmonella typhi and Salmonella typhimurium species, and Escherichia coli 
auxotrophic mutant cells (hemA aroB). Transformed host cells may express the 
bacterial hemoglobin receptor protein, but host cells transformed for purposes of 
cloning or amplifying nucleic acid hybridization probe DNA need not express the 
receptor protein. When expressed, the bacterial hemoglobin receptor protein of the 
invention will typically be located in the host cell outer membrane. See, Sambrook 
et al., ibid. 

Cultures of bacterial cells, particularly cells of Neisseria species, and certain 
E. coli mutants, are a desirable host for recombinant bacterial hemoglobin receptor 
protein synthesis. In principal, any bacterial cell auxotrophic for uncomplexed iron 
(III) is useful for selectively growing bacterial hemoglobin receptor protein- 
transformed cells. However, for this purpose, well-characterized auxotrophs, such 
as E. coli hemA aroB mutants are preferred. 

The invention provides homogeneous compositions of a bacterial hemoglobin 
receptor protein produced by transformed cells as provided herein. Each such 
homogeneous composition is intended to be comprised of a bacterial hemoglobin 
receptor protein that comprises at least 90% of the protein in such a homogenous 
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composition. The invention also provides membrane preparations from cells 
expressing a bacterial hemoglobin receptor protein as the result of transformation 
with a recombinant expression construct of the invention, as described herein. 

Bacterial hemoglobin receptor proteins, peptide fragments thereof and 
5 membranes derived from cells expressing such proteins in accordance with the 
present invention may be used for the production of vaccines effective against 
bacterial infections in a himian, with pathogenic microorganisms expressing such 
bacterial hemoglobin receptor proteins. Such vaccines preferably would be effective 
in raising an immunological response against bacteria of Neisseria species, most 

10 preferably N. meningitidis and N. gonorhoeae. Also encompassed within the 
vaccines provided by the invention are recombinant expression constructs as 
disclosed herein useful per se as vaccines, for introduction into an animal and 
production of an immunologic response to bacterial hemoglobin receptor protein 
antigens encoded therein. 

15 Preparation of vaccines which contain polypeptide or polynucleotide 

sequences as active ingredients is well understood in the art. Typically, such 
vaccines are prepared as injectables, either as liquid solutions or suspensions. 
However, solid forms suitable for solution in, or suspension in, liquid prior to 
injection may also be prepared. The preparation may also be emulsified. The active 

20 immunogenic ingredient is often mixed with excipients which are pharmaceutically 
acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations 
thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants 

25 which enhance the effectiveness of the vaccine. The vaccines are conventionally 
administered parenterally, by injection, for example, either subcutaneously or 
intramuscularly. Additional formulations which are suitable for other modes of 
administration include suppositories and, in some cases, oral formulations. For 
suppositories, traditional binders and carriers may include, for example, polyalkalene 

30 glycols or triglycerides; such suppositories may be formed from mixtures containing 
the active ingredient in the range of 0.5% to 10%, preferably 1 to 2%. Oral 
formulations include such normally employed excipients as, for example, 
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pharmaceutical grades of manitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate and the like. These compositions take 
the form of solutions, suspensions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10% to 95% of active ingredient, preferably 25 
to 70%. 

The polypeptides of the invention may be formulated into the vaccine as 
neutral or salt forms. Pharmaceutically acceptable salts, include the acid additional 
salts (formed with the free amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with 
the free carboxyl groups may also be derived from inorganic bases such as, for 
example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such 
organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 
procaine, and the like. 

In another embodiment, such vaccines are provided wherein the bacterial 
hemoglobin receptor proteins or peptide fragments thereof are present in the intact 
cell membranes of cells expressing such proteins in accordance with the present 
invention. In preferred embodiments, cells useful in these embodiments include 
attenuated varieties of cells adapted to growth in humans. Most preferably, said cells 
are attenuated varieties of cells adapted for growth in humans, i.e., wherein such 
cells do not cause frank disease or other pathological conditions, such as bactermia, 
endotoxemia or sepsis. For the purposes of this invention, "attenuated" cells will be 
understood to encompass prokaryotic and eukaryotic cells that do not cause infection, 
disease, septicemia, endotoxic shock, pyrogenic shock, or other serious and adverse 
reactions to administration of vaccines to an animal, most preferably a human, when 
such cells are introduced into the animal, whether such cells are viable, living, heat-, 
chemically- or genetically attenuated or inactivated, or dead. It will be appreciated 
by those with skill in this art that certain minor side-effects of vaccination, such as 
short-term fever, muscle discomfort, general malaise, and other well-known reactions 
to vaccination using a variety of different types of vaccines, can be anticipated as 
accompanying vaccination of an animal, preferably a human, using the vaccines of 
the invention. Such acute, short-term and non-life-threatening side effects are 
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encompassed in the instant definition of the vaccines of the invention, and vaccines 
causing such side-effects fall within the definition of "attenuated" presented herein. 
Preferred examples of such attenuated cells include attenutated varieties of 
Salmonella species, preferably Salmonella typhi and Salmonella typhimurium, as well 
5 as other attenuated bacterial species. It will be specifically understood that these 
embodiments of the vaccines of the invention encompass so-called "live" attenuated 
cell preparations as well as heat- or chemically- inactivated cell preparations. 

In other embodiments of the invention are provided vaccines that are DNA 
vaccines, comprising the nucleic acids of the invention in recombinant expression 

10 constructs competant to direct expression of hemoglobin receptor proteins when 
introduced into an animal. In preferred embodiments, such DNA vaccines comprise 
recombinant expression constructs wherein the hemoglobin receptor-encoding nucleic 
acids of the invention are operably linked to promoter elements, most preferably the 
early gene promoter of cytomegalovirus or the early gene promoter of simian virus 

15 40. DNA vaccines of the invention are preferably administered by intramuscular 
injection, but any appropriate route of administration, including oral, transdermal, 
rectal, nasal, aerosol administration into lung, or any other clinically-acceptable route 
of administration can be used by those with skill in the art. 

In general, the vaccines of the invention are administered in a maimer 

20 compatible with the dosage formulation, and in such amount as will be 
therapeutically effective and immunogenic. The quantity to be administered depends 
on the subject to be treated, capacity of the subject's immune system to synthesize 
antibodies, and the degree of protection desired. Precise amounts of active 
ingredient required to be administered depend on the judgment of the practitioner and 

25 are peculiar to each individual. However, suitable dosage ranges are of the order 
of several hundred micrograms active ingredient per individual. Suitable regimes for 
initial administration and booster shots are also variable, but are typified by an initial 
administration followed in one or two week intervals by a subsequent injection or 
other administration. 

30 The recombinant expression constructs of the present invention are also useful 

in molecular biology to transform bacterial cells which do not ordinarily express a 
hemoglobin receptor protein to thereafter express this receptor. Such cells are 
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useful, inter alia, as intermediates for making cell membrane preparations useful for 
receptor binding activity assays, vaccine production, and the like, and in certain 
embodiments may themselves be used, inter alia, as vaccines or components of 
vaccines, as described above. The recombinant expression constructs of the present 
invention thus provide a method for screening potentially useful bactericidal and 
bacteriostatic drugs at advantageously lower cost than conventional screening 
protocols. While not completely eliminating the need for ultimate in vivo activity 
and toxicology assays, the constructs and cultures of the invention provide an 
important first screening step for the vast number of potentially useful bactericidal 
and bacteriostatic drugs synthesized, discovered or extracted from natural sources 
each year. In addition, such bactericidal or bacteriostatic drugs would be selected 
to utilize a nutritional pathway associated widi infectious virulence in these types of 
bacteria, as disclosed in more detail below, thus selectively targeting bacteria 
associated with the development of serious infections in vivo. 

Also, the invention provides both functional bacterial hemoglobin receptor 
proteins, membranes comprising such proteins, cells expressing such proteins, and 
the amino acid sequences of such proteins. This invention thereby provides sufficient 
structural and functional activity information to enable rational drug design of novel 
therapeutically-active antibacterial drugs using currently-available techniques (see 
Walters, "Computer-Assisted Modeling of Drugs", in Klegerman & Groves, eds., 
1993, Pharmaceutical Biotechnolopv. Interpharm Press: Buffalo Grove, IL, pp. 165- 
174). 

Nucleic acids and oligonucleotides of the present invention are useful as 
diagnostic tools for detecting the existence of a bacterial infection in a human, caused 
by a hemoglobin receptor protein-expressing pathological organism of Neisseria 
species. Such diagnostic reagents comprise nucleic acid hybridization probes of the 
invention and encompass paired oligonucleotide PGR primers, as described above. 
Methods provided by the invention include blot hybridization, in situ hybridization 
and in vitro amplification techniques for detecting the presence of pathogenic bacteria 
in a biological sample. Appropriate biological samples advantageously screened 
using the methods described herein include plasma, serum, lymph, cerebrospinal 
fluid, seminal fluid, mucosal tissue samples, biopsy samples, and other potential sites 
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of bacterial infection. It is also envisioned that the methods of the invention may be 
used to screen water, foodstuffs, pharmaceuticals, and other potential sources of 
infection. 

The invention also provides antibodies that are immunologically reactive to 
5 a bacterial hemoglobin receptor protein or epitopes thereof provided by the 
invention. The antibodies provided by the invention may be raised, using methods 
well known in the art, in animals by inoculation with cells that express a bacterial 
hemoglobin receptor protein or epitopes thereof, cell membranes from such cells, 
whether crude membrane preparations or membranes purified using methods well 
10 known in the art, or purified preparations of proteins, including fiision proteins, 
particularly fusion proteins comprising epitopes of a bacterial hemoglobin receptor 
protein of the invention fused to heterologous proteins and expressed using genetic 
engineering means in bacterial, yeast or eukaryotic cells, said proteins being isolated 
from such cells to varying degrees of homogeneity using conventional biochemical 
15 means. Synthetic peptides made using established synthetic means in vitro and 
optionally conjugated with heterologous sequences of amino acids, are also 
encompassed in these methods to produce the antibodies of the invention. Animals 
that are used for such inoculations include individuals from species comprising cows, 
sheep, pigs, mice, rats, rabbits, hamsters, goats and primates. Preferred animals for 
20 inoculation are rodents (including mice, rats, hamsters) and rabbits. The most 
preferred animal is the mouse. 

Cells that can be used for such inoculations, or for any of the other means 
used in the invention, include any cell that naturally expresses a bacterial hemoglobin 
receptor protein as provided by the invention, or any cell or cell line that expresses 
25 a bacterial hemoglobin receptor protein of the invention, or any epitope thereof, as 
a result of molecular or genetic engineering, or that has been treated to increase the 
expression of an endogenous or heterologous bacterial hemoglobin receptor protein 
by physical, biochemical or genetic means. Preferred cells are E. coli auxotrophic 
mutant hemA aroB cells transformed with a recombinant expression construct of the 
30 invention and grown in media supplemented with hemin or hemoglobin as the sole 
iron (m) source, and cells of Neisseria species. 
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The present invention also provides monoclonal antibodies that are 
immunologically reactive with an epitope of a bacterial hemoglobin receptor protein 
of the invention, or fragment thereof, present on the surface of such cells, preferably 
E. coli cells. Such antibodies are made using methods and techniques well known 
to those of skill in the art. Monoclonal antibodies provided by the present invention 
are produced by hybridoma cell lines, that are also provided by the invention and 
that are made by methods well known in the art {see Harlow and Lane, 1988, 
Antibodies: A Laboratorv Manual . Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y.). 

Hybridoma cell lines are made by fusing individual cells of a myeloma cell 
line with spleen cells derived from animals immunized with a homogeneous 
preparation of a bacterial hemoglobin receptor protein, membranes comprised 
thereof, cells expressing such protein, or epitopes of a bacterial hemoglobin receptor 
protein, used per se or comprising a heterologous or fusion protein construct, as 
described above. The myeloma cell lines used in the invention include lines derived 
from myelomas of mice, rats, hamsters, primates and humans. Preferred myeloma 
cell lines are from mouse, and the most preferred mouse myeloma cell line is 
P3X63-Ag8.653. The animals from whom spleens are obtained after immunization 
are rats, mice and hamsters, preferably mice, most preferably Balb/c mice. Spleen 
cells and myeloma cells are fiised using a number of methods well known in the art, 
including but not limited to incubation with inactivated Sendai virus and incubation 
in the presence of polyethylene glycol (PEG). The most preferred method for cell 
fusion is incubation in the presence of a solution of 45% (w/v) PEG- 1450. 
Monoclonal antibodies produced by hybridoma cell lines can be harvested from cell 
culture supernatant fluids from in vitro cell growth; alternatively, hybridoma cells 
can be injected subcutaneously and/or into the peritoneal cavity of an animal, most 
preferably a mouse, and the monoclonal antibodies obtained from blood and/or 
ascites fluid. 

Monoclonal antibodies provided by the present invention are also produced 
by recombmant genetic methods well known to those of skill in the art, and the 
present invention encompasses antibodies made by such methods that are 
immunologically reactive with an epitope of a bacterial hemoglobin receptor protein 
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of the invention. The present invention also encompasses fragments, including but 
not limited to F(ab) and F(ab)2 fragments, of such antibody. Fragments are 
produced by any number of methods, including but not limited to proteolytic 
cleavage, chemical synthesis or preparation of such fragments by means of genetic 
engineering technology. The present invention also encompasses single-chain 
antibodies that are immunologically reactive with an epitope of a bacterial 
hemoglobin receptor protein, made by methods known to those of skill m the art. 

The antibodies and fragments used herein can be labeled preferably with 
radioactive labels, by a variety of techniques. For example, the biologically active 
molecules can also be labeled with a radionucleotide via conjugation with the cyclic 
anhydride of diethylenetriamine penta-acetic acid (DPTA) or bromoacetyl 
aminobenzyl ethylamine diamine tetra-acidic acid (BABE). See Hnatowich et al. 
(1983, Science 22^: 613-615) and Meares et al. (1984, Anal. Biochem. 142: 68-78, 
both references incorporated by reference) for further description of labeling 
techniques. 

The present invention also encompasses an epitope of a bacterial hemoglobin 
receptor protein of the invention, comprised of sequences and/or a conformation of 
sequences present in the receptor molecule. This epitope may be naturally occurring, 
or may be the result of proteolytic cleavage of a receptor molecule and isolation of 
an epitope-containing peptide or may be obtained by synthesis of an epitope- 
containing peptide using methods well known to those skilled m the art. The present 
invention also encompasses epitope peptides produced as a result of genetic 
engineering technology and synthesized by genetically engineered prokaryotic or 
eukaryotic cells. 

The invention also includes chimeric antibodies, comprised of light chain and 
heavy chain peptides immunologically reactive to a bacterial hemoglobin receptor 
protein-derived epitope. The chimeric antibodies embodied in the present invention 
include those that are derived from naturally occurring antibodies as well as chimeric 
antibodies made by means of genetic engineering technology well known to those of 
skill in the art. 

Also provided by the present invention are diagnostic and therapeutic methods 
of detecting and treating an infection in a human, by a pathogenic organisms 
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expressing a bacterial hemoglobin receptor protein. Diagnostic reagents for use in 
such methods include the antibodies, most preferably monoclonal antibodies, of the 
invention. Such antibodies are used in conventional immunological techniques, 
including but not limited to enzyme-linked immunosorbent assay (ELISA), 
radioimmune assay (RIA), Western blot assay, immunological titration assays, 
immunological diffusion assays (such as the Ouchterlony assay), and others known 
to those of skill in the art. Also provided are epitopes derived from a bacterial 
hemoglobin receptor protein of the invention and immunologically cross-reactive to 
said antibodies, for use in any of the immunological techniques described herein. 

Additional diagnostic assays include nucleic acid hybridization assays, using 
the nucleic acids of the invention or specifically-hybridizing fragments thereof, for 
sensitive detection of bacterial genomic DNA and/or mRNA. Such assays include 
various blot assays, such as Southern blots. Northern blots, dot blots, slot blots and 
the like, as well as in vitro amplification assays, such as the polymerase chain 
reaction assay (PCR), reverse transcriptase-polymerase chain reaction assay (RT- 
PCR), ligase chain reaction assay (LCR), and others known to those skilled in the 
art. Specific restriction endonuclease digestion of diagnostic fragments detected 
using any of the methods of the invention, analogous to restriction fragment linked 
polymorphism assays (RFLP) are also within the scope of this invention. 

The invention also provides therapeutic methods and reagents for use in 
treating infections in a human, cause by a microorganism expressing a bacterial 
hemoglobin receptor protein of the invention, most preferably a bacteria of Neisseria 
species. Therapeutic reagents for use in such methods include the antibodies, most 
preferably monoclonal antibodies, of the invention, either per se or conjugated to 
bactericidal or bacteriostatic drugs or other antibiotic compoimds effective against the 
infectious microorganism. In such embodiments, the antibodies of the invention 
comprise pharmaceutical compositions, additionally comprising appropriate 
pharmaceutically-acceptable carriers and adjuvants or other ancillary components 
where necessary. Suitable carriers are, for example, water, saline, dextrose, 
glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the 
pharmaceutical formulation may contain minor amounts of auxiliary substances such 
as wetting or emulsifying agents, pH buffering agents, or other compounds which 
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enhance the effectiveness of the antibody. In these embodiments, it will be 
understood that the therapeutic agents of the invention serve to target the infectious 
bacteria, either by immunologically "tagging" the bacteria with an antibody of the 
invention for recognition by cytotoxic cells of a human's immune system, or by 
5 specifically delivering an antimicrobial drug to the infectious microorganism via the 
bacterial hemoglobin receptor protein. 

Additional therapeutic reagents include the nucleic acids of the invention or 
fragments thereof, specifically antisense embodiments of such nucleic acids. Such 
antisense nucleic acids may be used themselves or embodied in a recombinant 
10 expression construct specific for antisense expression, wherein said construct is 
genetically engineered to co-opt a portion of the genome of a bacterial virus, 
preferably a bacteriophage, infectious for the bacterial pathogen responsible for the 
infection. In these embodunents, introduction of the antisense nucleic acids of the 
invention into the bacterial cell inhibits, attentuates or abolishes expression of the 
15 bacterial hemoglobin receptor, thereby reducing the virulence of the bacterial 
infection and enabling more effective antibacterial interventions. In additional 
embodiments, bacteriophage are provided bearing "knockout" copies of a bacterial 
hemoglobin receptor gene, whereby the phage achieves genetic mutation of the 
endogenous hemoglobin receptor gene in the infectious bacteria via, for example, 
20 homologous recombination of the exogenous knockout copy of the bacterial 
hemoglobin receptor gene with the endogenous hemoglobin receptor gene in the 
infectious microorganism. 

The Examples which follow are illustrative of specific embodiments of the 
invention, and various uses thereof. They set forth for explanatory purposes only, 
25 and are not to be taken as limiting the invention. 

EXAMPLE 1 
Plasmids. bacteria, and media 
Plasmids and bacteria used herein are listed on Table 1. E. coli strains were 
30 routinely grown in Luria-Bertani (LB) broth supplemented with 5-aminolevulinic acid 
and 50mg/L hemin chloride as necessary. A^. meningitidis 8013 is a serogroup C 
clinical isolate (Nassif «f a/. , 1993, Mol. Microbiol. 8: 719-725). The meningococci 
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were routinely grown on GCB agar (Difco) supplemented as described by Kellogg 
et al. (1963, J. Bacterial 85: 1274-1279), and incubated at 'hTC under a 5% COj 
atmosphere. Transformation of meningococci was performed as described by Nassif 
et al. (1992, Mol. Microbiol. 6: 591-597). When necessary, the following antibiotics 
5 were used with E. colt: rifampicin, 100 mg/L; tetracycline, 15 mg/L; kanamycin, 
30 mg/L; chloramphenicol, 20 mg/L; carbenicillin, 100 mg/L. For Neisseriae, 
kanamycin at 100 mg/L was used when needed. 

EXAMPLE 2 

10 Auxotroph Complementation Cloning of a hemoglobin Receptor Gene from 

Neisseria meninntidis 

In order to identify N. meningitidis outer membrane receptor(s) involved in 

the uptake of haemin and/or haemoglobin iron, an auxotroph complementation 

cloning strategy was used, similar to the approach previously taken to identify the 

15 Y. enterocolitica and V. cholerae hemin receptors {see Stojiljkovic and Hantke, 1992, 
EMBO J. U: 4359-4367; Henderson and Payne, 1994, J. Bacterial. 176: 3269- 
3277). This strategy is based on the fact that the outer membrane of Gram-negative 
bacteria is impermeable to hemin (McConville and Charles, 1979, /. Microbiol. 113: 
165-168) and therefore E. coli porphyrin biosynthesis mutants cannot grow on 

20 exogenously supplied hemin. If provided with the N. meningitidis outer membrane 
hemin receptor gene, the E. coli porphyrin mutant would be able to use exogenously 
supplied hemin as its porphyrin source. 

A cosmid bank of N. meningitidis 8013 clone 6 DNA was prepared using 
conventional cosmid cloning methodologies (Sambrook et al., 1989, ibid.). N. 

25 meningitidis bacterial DNA was partially digested by Mbol, size fractionated on 
sucrose gradients and cloned into the fiamHI site of the cosmid vector pLAFR2 
(Riboli et al., 1991, Microb. Pathogen. 10: 393-403). This cosmid bank was 
mobilized into the E. coli hemA aroB Rif recipient strain by triparental matings 
using a conjugal plasmid pRK2013::Tn9. The mating mixture was plated onselective 

30 plates containing hemin chloride (50mg/L), 0.1 mM 2,2'-dypyridil and rifampicin 
(100 mg/L). Several clones growing on exogenously supplied haemin were isolated 
after an overnight incubation. 



-32 - 



wo 96/12020 



PCT/US95/13623 



STRAIN 

E. coli K12 

EB53 

KP1041 

H1388 

TSM348 

IR754 

IR736 

DH5a 

N. meningitidis 
ATCC 13077 

MC8013 
MChmbR 

N. gonorrhoeae MSllA 




GENOTYPE 

hemA, aroB, rpoB 

MCAmtonB::Knf 

exbB::JnlO Mac pro 

endA, hsdR, pro, supF, pRK2013::Tii9 

EB53, tonB::Knf 

EB53, exbB::TnlO 

recA, gyrB 

Serotype A 
Serotype B* 
clone 6, wild type 
hmbR::aphA-3 



pA15 replicon, chloramphenicor 

pLAFR2, hemoglobin-utilizing cosmid 

pLAFR2, hemin-utilizing cosmid 

6kb Clal, pSUSK 

3kb BamHl/San, pUC19 

1.2kb aphA-3, in Notl site of pIRS523 

4kb BammiClal, pBluescript 

0.7kb Notl/BamHl, pBluescript 

3.3kb Bamm/Hindm, SU(SK) 



' Laboratory collection 
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The hemin utilization phenotype of these transformants was tested by re- 
introduction of the cosmids into naive E. coli hetnA aroB cells and by monitoring the 
growth on hemin-supplemented plates. The ability of E. coli strains to utilize heme 
or hemoglobin as the sole iron source was tested as previously described (Stojiljkovic 
and Hantke, 1992, ibid.). Cells were grown on LB agar supplemented with 50;iM 
deferoxamine mesylate (an iron chelating agent, obtained from Sigma Chemical Co. , 
St. Louis, MO). Filter discs (1/4 inches, Schleichner & Schuell, Inc., Keene, NH.) 
impregnated with the test compounds (20 /zL of 5 mg/ml stock solutions unless 
otherwise stated) were placed on these plates. After overnight growth at ^TC with 
5 % CO2, zones of growth around the discs were monitored. The iron-bound proteins 
tested in this assay (all obtained from Sigma Chemicals Co.) were hemoglobin from 
human, baboon, bovine and mouse sources, bovine hemin, human lactoferrin (90% 
iron saturated), and human transferrin (90% iron saturated, obtained from Boehringer 
Mannheim Biochemicals, Indianapolis, IN). A total of six hemin utilization positive 
cosmids were obtained using this protocol. Results using such assays are shown in 
Table II. 

EXAMPLE 3 

Restriction Enzyme Digestion Mapping of Hemin Utilization 
Positive CosmlAs 

Cosmid DNA from six hemin-utilization positive cosmids obtained as 
described in Example 2 were digested with Clal, and the resulting fragments were 
cloned into Ctol-digested pSU(SK) vector (obtained from Stratagene, LaJolla, CA). 
One subclone, containing a 6 kb Clal fragment from cosmid cos22 (the resultant 
plasmid was designated pIRS508), was determined to allow utilization of hemin and 
hemoglobin by E. coli hemA aroB assayed as described in Example 2. Another such 
clone, containing an 11 kb Clal fragment from cos44 was also determined to allow 
hemin utilization in these auxotrophic mutant cells. Restriction analysis and Southern 
hybridization indicated that the DNA fragments originating from cos22 and cos44 are 
unrelated. 

The deduced restriction enzyme digestion map of cosmid clone pIRS508 is 
shown in Figure 1. Plasmid pIRS508 enabled E. coli hemA aroB to use both hemin 
and bovine hemoglobin as iron sources although growth on hemoglobin was 

-34- 



wo 96/12020 



PCT/US95/13623 



somewhat weaker than on hemin (Table II). Further subcloning localized the 
hemin/hemoglobin utilization locus to the BamHVHindlll fragment of the insert. In 
addition to sequences encoding the hemoglobin receptor gene (designated hmbR), 
sequences for a Neisseria insertion element (LSI 106) and a portion of a Neisseria 
5 small repetitive element (IR/) are also represented in the Figure. 

EXAMPLE 4 

Nudeotide Sequence Analysis of a Cosmid Clone Encoding 
a Neisseria Hemogiobin Receptor Gene 

10 The nucleotide sequence of the 3.3 kb BamHl-Hindlll DNA fragment 

carrying the hmbR gene and its promoter region was determined using the dideoxy 
chain termination method using a Sequenase 2.0 kit (obtained from U.S. 
Biochemicals, Cleveland, OH) and analyzed using a BioRad electrophoresis system, 
an AutoRead kit (obtained from Pharmacia, Uppsala, SE) and an ALF-370 automatic 

15 sequenator (Pharmacia, Uppsala, Sweden). Plasmid subclones for sequencing were 
produced by a nested deletion approach using Erase-a-Base kit (obtained from 
Promega Biotech, Madison, WI) using different restriction sites in the hmbR gene. 
The nucleotide and predicted amino acid sequences of the hmbR gene are shown in 
Figure 2 

20 An open reading frame (ORF) encoding the N. meningitidis, serotype C 

hemoglobin receptor protein begins at position 470 of the sequence and encodes a 
protein having an amino acid sequence of 792 amino acids, with a calculated 
molecular weight of 85.5 kDa. A Shine-Delgamo sequence (SD) is found at position 
460. The HmbR receptor protein contains a signal peptidase I recognition sequence 

25 at residues 22 to 24 of the protein (underlined) , consistent with the fact that it is an 
outer membrane protein. 

A typical Fur binding nucleotide sequence (designated "Fur box") was found 
in the promoter region of the hmbR gene (Figure 2). Like hemin utilization in 
Yersiniae and Vibrio, hemin and hemoglobin utilizati n in Neisseria are known to be 

30 iron-inducible phenotypes (West and Sparling, 1985, Infect. Immun. 47: 388-394; 
Dyer et al, 1987, Infect. Immun. 55: 2171-2175). In Gram-negative bacteria, 
conditional expression of many iron utilization genes is regulated by the Fur 
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repressor, which recognizes a 19 bp imperfect dyad repeat (Fur-box) in the promoter 
regions of Fur-repressed genes. Recently, a genetic screen (FURTA) for the 
identification of Fur-regulated genes from different Gram-negative bacteria was 
described (Stojiljkovic etal, 1994, J. Mol. Biol. 236: 531-545), and this assay was 
5 used to test whether hmbR expression was controlled in this way. Briefly, a plasmid 
carrying a Fur-box sequence is transformed into an E. coli strain (H1717) which 
possesses a Fur-regulated lac fusion in the chromosome. Expression of this Fur- 
regulated lac fusion is normally repressed. Introduction of a multicopy Fur-box 
sequence on the plasmid titrates the available Fur repressor thus allowing expression 

10 of the Fur-regulated lac fusion (this phenotype is termed FURTA positive). Using 
this screen, the smallest insert fragment from cosmid pIRS508 that produced a 
FURTA positive result was a 0.7 kb BamHl-Notl DNA fragment carried on plasmid 
pIRS528 (see Figure 1). This result indicated that the 0.7 kb Bamm-Notl fragment 
carries a Fur-box and that gene expression from the hmbR promoter is controlled by 

15 a fur-type operon. 

N. meningitidis, serotype C hemoglobin receptor protein was expressed in 
vitro using an E. coli S30 extract system from Promega Biotech (Madison, WI). The 
3.3 kb BamHl-Hindlll fragment, expressed in vitro, encoded a 90kDa protein which 
corresponds in size to the predicted molecular weight of the unprocessed HmbR 

20 receptor. SDS/ 10% PAGE analysis showing the observed of 90K is shown in 
Figure 3. 

Immediately downstream of the hmbR gene (at positions 2955 to 3000 bp in 
Figure 2) was found a short nucleotide sequence that is 99% identical to the flanking 
sequence of the PHI gene of N. gonorrhoeae (Gotschlich et al. , 1987, J. Exp. Med. 

25 165: 471-482). The first 26 bp of this sequence represents one half of the inverted 
repeat (IRl) of the N. gonorrhoeae small repetitive element. This element is found 
in approximately 20 copies in both N. gonorrhoeae and N. meningitidis (Correia et 
al., 1988, J. Biol. Chem. 263: 12194-12198). The analysis of the nucleotide 
sequence fix)m position 3027 to the Clal (3984) restriction site (only the nucleotide 

30 sequence from BamHl (1) to HindUI (3370) is shown in Figure 2) indicated the 
presence of an IS1106 element (Knight et al. , 1992, Mol. Microbiol. 6: 1565-1573). 
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Interestingly, no nucleotide sequence similar to the IS 11 06 inverted repeat was found 
between the IRl element and the beginning of the homology to IS1106. 

These results were consistent with the cloning and identification of a novel 
hemoglobin receptor protein gene from N. meningitidis, embodied in a 3.3kb 
5 BamUl/Hindni fragment of N. meningitidis genomic DNA. 

EXAMPLES 

Amino Acid Sequence Comparison of the N. meningitidis 
Hemoglobin Receptor Protein and Neisseria 
10 Lactoferrin and Transferrin Receptor Proteins 

A comparison of the transferrin (Tbpl; Legrain et al, 1993, Gene 130: 81- 

90), lactoferrin (LbpA; Pettersson et al, 1993, Infect. Immun. 61: 4724-4733, and 

1994, J. Bacterial. 176: 1764-1766) and hemoglobin receptors (HmbR) from N. 

meningitidis is shown in Figure 4. The comparison was done with the CLASTAL 

15 program from the PC/GENE program package (Intelligenetics, Palo Alto, CA). 
Only the amino-terminal and carboxyl terminal segments of the proteins are shown. 
An asterisk indicates identity and a point indicates similarity at the amino acid level. 
Lactoferrin and transferrin receptors were found to share 44.4% identity in amino 
acid sequence. In contrast, homology between these proteins and the hemoglobin 

20 receptor disclosed herein was found to be significantly weaker (22% amino acid 
sequence identity with lactoferrin and 21% with transferrin receptor). 

EXAMPLE 6 

TonB/ExbBD-Dependence of Hemin Transport by the A^. meningitidis 
25 Hemoglobin Receptor 

It was known that the transport of iron-containing siderophores, some colicins 

and vitamin B12 across the outer membrane of E. coli depends on three cytoplasmic 

membrane proteins: TonB, ExbB and ExbD (Posde 1990, Mol. Microbiol. 133: 891- 

898; Braun and Hantke, 1991, in Winkelmann, (ed.). Handbook of Microbial Iron 

30 Chelates . CRC Press, Boca Raton, Fla. , pp. 107-138). In Yersinia and Hemophilus, 

hemin uptake was shown to be a TonB-dependent process (Stojiljkovic and Hantke, 

1992, ibid.; Jarosik et al, 1994, Infect. Immun. 62: 2470-2477). Through direct 

interaction between the outer membrane receptors and the TonB cytoplasmic 
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machinery, the substrate bound to the receptor is internalized into the periplasm 
(Heller et al., 1988, Gene 64: 147-153; Schoffler and Braun, 1989, Molec. Gen. 
Genet. 217: 378-383). This direct interaction has been associated with a particular 
amino acid sequence in membrane proteins associated with the TonB machinery. 
5 All TonB-dependent receptors m Gram-negative bacteria contain several 

regions of high homology in their primary strucmres (Lundrigan and Kadner, 1986, 
J. Biol. Chem. 261: 10797-10801). In the amino acid sequence comparison 
described in Example 5, putative TonB-boxes of all three proteins are underlined. 
The carboxyl terminal end of the HmbR receptor contains the highly conserved 
10 terminal phenylalanine and position 782 arginine residues thought to be part of an 
outer membrane localization signal (Struyve et al. , 1991, /. Mol. Biol. 218: 141-148; 
Koebnik, 1993, Trends Microbiol. 1: 201). At residue 6 of the mature HmbR 
protein, an amino acid sequence - ETTPVKA - is similar in sequence to the so called 
TonB-boxes of several Gram-negative receptors (Heller et al., 1988, ibid.). 

15 Interestingly, the putative TonB-box of HmbR has more homology to the TonB-box 
of the N. gonorrhoeae transferrin receptor (Comelissen et al. , 1992, Bacterial. 
174: 5788-5797) than to the TonB-boxes of £. coli siderophore receptors. When the 
sequence of the HmbR receptor was compared with other TonB-dependent receptors, 
the highest similarity was found with Y. enterocolitica HemR receptor although the 

20 similarity was not as high as to the Neisseria receptors. 

In order to prove the TonB-dependent namre of the A^. meningitidis, serotype 
C hemoglobin receptor, hmbR was introduced into e^B and tonB mutants of E. coli 
EB53, and the ability of the strains to utilize hemin and hemoglobin as porphyrin and 
iron sources was assessed. In these assays, both mutants of E. coli EB53 were 

25 imable to use hemin either as a porphyrin source or as an iron source in the presence 
of a fiinctional hmbR (Table 2). The usage of hemoglobin as an iron source was also 
affected (Table 2). These results are consistent with the notion that the hmbR gene 
product, the N. meningitidis hemoglobin receptor protein of the invention, is TonB- 
dependent, smce expression of this gene in TonB wild type E. coli supported the use 

30 of hemin and hemoglobin as sole iron source in the experiments disclosed in 
Example 2. 



-39- 



96/12020 



PCTAJS95/13623 



EXAMPLE? 

Functional Demonstration that the hmbR Gene Product is the 
Hemoglobin Recep tor Protein in N. menin^tidix 

As shown in the data presented in Table II, hmbR mediated both hemin and 

hemoglobin utilization when expressed in E. coli, but hemoglobin utilization was less 

vigorous than hemin utilization. To determine if the HmbR receptor has the same 

specificity in//, meningitidis, hmbR was inactivated with a 1.2kb kanamycin cassette 

{aphA-3; Nassif et al, 1991, ibid.) and transformed into wild-type N. meningitidis 

8013 clone 6 (serotype C) cells. The inactivation of the chromosomal hmbR copy 

of the Km-resistant transformants was confirmed by Southern hybridization, as 

shown in Figure 5. As can be seen from Figure 5, wild-type N. meningitidis 

genomic DNA contains only one copy of the hmbR gene (lanes 1 and 3). In the Km' 

transformants, the size of the DNA ft-agments containing the wild-type gene has 

increased by 1.2 kb, which is the size of the Kan cassette (Figure 5, lanes 2 and 4). 

When tested for its ability to utilize different iron-containing compounds, these 

mutant cells were found to be unable to use hemoglobin-bound iron, regardless of 

the source (human, bovine, baboon, mouse). The ability of the mutant to utilize 

hemoglobin-haptoglobin was not tested because the wild-type N. meningitidis strain 

is unable to use haptoglobin-haemoglobin complex as an iron source. However, the 

mutant was still able to use hemin iron, lactoferrin- and transferrin-bound iron as 

well as citrate-iron (Table II). As the iron-containing component of hemoglobin is 

hemin, a hemoglobin receptor would be expected to be capable of transporting hemin 

into the periplasm. Indeed, the cloning strategy disclosed herein depended on the 

ability of the cloned meningococcal receptor to transport hemin into the periplasm 

of E. coli. These results strongly suggest that N. meningitidis has at least two 

functional receptors that are involved in the internalization of hemin-containing 

compounds. One is the hemoglobin receptor described herein, which allows the 

utilization of both hemin and hemoglobin as iron sources. The other putative 

receptor in N. meningitidis is a hemin receptor which allows utilization of only 

hemin. This schema is also consistent with the isolation of several cosmid clones 

that allow E. coli EB53 to utilize hemin. DNAs from these cosmids do not hybridize 

with our hmbR probe , indicating that these clones encode a strucnirally-distinct 
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receptor protein capable of transporting hemin into the periplasm of A^. meningitidis 
cells. 

EXAMPLES 

5 Attenuation of Virulence in hmbR Mutant N. meningitidis Cells In Vivo 

In order to test the importance of hemoglobin and hemin scavenging systems 
of N. meningitidis in vivo, the hmbR -mutant and the wild type strain of N. 
meningitidis, serotype C were inoculated into 5 day old infant rats and the numbers 
of bacteria recovered from blood and cerebrospinal fluid were followed. In these 

10 experiments, the method for the assessing N. meningitidis, serotype C virulence 
potential was essentially die same as described by Nassif et al. (1992, ibid.) using 
infant inbred Lewis rats (Charles River, Saint Aubin les Elbeufs, France). Inbred 
rats were used to minimize individual variations. Briefly, the 8013 strain was 
reactivated by 3 animal passages. After the third passage, bacteria were kept frozen 

15 in aliquots at -80° C. To avoid the possibility that modifications m the course of 
infection could result from selection of one spontaneous avirulent variant, one aliquot 
from the animal-passed frozen stock of 8013 was transformed with chromosomal 
DNA from the hmbR mutant, the resultant Kan' transformants were pooled without 
further purification and kept frozen at -80°C. For each experiment, all infant rats 

20 were from the same litter. N. meningitidis 8013 was grown overnight and 2 X 10' 
bacteria injected intraperitoneally into the infant rat. Three rats were used for each 
meningococcal strain. The course of infection was followed over a 24 hours time 
period with blood collected at the indicated times. At the 24 h tune period, the rats 
were sacrificed, the cerebrospinal fluid (CSF) collected and the number of colony- 

25 forming units (CFU) determined. Each experiment was performed in replicate; 
sunilar results were obtained both times. 

The results of these experiments are shown in Figure 6. The hmbR strain, 
which is unable to use hemoglobin as an iron source, was recovered from the blood 
of infected animals in significantly lower numbers when compared with the wild type 

30 strain. Both the mutant and the wild type strain were still able to cross the blood- 
brain barrier as indicated by the isolation of bacteria from the cerebrospinal fluid. 
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These results indicate that hemoglobin represents an important iron source for A^. 
meningitidis during growth in vivo. 

EXAMPLE 9 

Polymerase Chain Reaction Amplification of Hemoglobin Receptor 
Genes from N. meninntid is Serotypes and N. ponnrrhnpnt> 

From the nucleotide sequence of the 3.3 kb BamlU-Hindm DNA fragment 
carrying the hmbR gene and its promoter region was determined specific 
oligonucleotide promers for in vitro amplification of the homologous hemoglobin 
receptor protein genes from N. meningitidis serotypes A and B and N. gonorrhoeae 
MSllA as follows. 

The following oligonucleotide primers were developed for in vitro 
amplificaiton reactions using the polymerase chain reaction (PGR; Saiki et al. , 1988, 
Science 230: 1350-1354): 

5 '-AAACAGGTCTCGGCATAG-3 ' (sense primer) (SEQ ID No. : 1 1) 

5 '-CGCGAATTCAAACAGGTCTCGGCATAG-3 ' (SEQ ID No. : 12) 

(antisense primer) 

for amplifying the hemoglobm receptor protein from N. meningitidis, serotype A; 
5'-CGCGAATTCAAAAACTTCCATTCCAGCGATACG-3' (SEQ ID No. : 13) 
(sense primer) 

5'-TAAAACTTCCATTCCAGCGATACG-3' (antisense primer) (SEQ ID No.: 14) 
for amplifying the hemoglobin receptor protein from A^. meningitidis, serotype B; 
5 '-AAACAGGTCTCGGCATAG-3 ' (sense primer) (SEQ ID No. : 15) 

or 

5'-CGCGAATTCAAACAGGTCTCGGCATAG-3' (SEQ ID No.:16) 

(sense primer) 

and 

5'-CGCGAATTCAAAAACTTCCATTCCAGCGATACG-3' (SEQ ID No.:17) 
(antisense primer) 

or 

5'-TAAAACTTCCATTCCAGCGATACG-3' (antisense primer) (SEQ ID No.: 18) 
for amplifying the hemoglobin receptor protein firom A^. gonorrhoeae MSllA. 

Genomic DNA from N. meningitidis serotype A or B or N. gonorrhoeae 
species was prepared using standard techniques {see Sambrook, et al., ibid.), 
including enzymatic degradation of bacterial cell walls, protoplast lysis, protease and 
RNase digestion, extraction with organic solvents such as phenol and/or chloroform. 
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and ethanol precipitation. Crude DNA preparations were also used. An amount 
(typically, about 0.1/ig) of genomic DNA was used for each amplification reaction. 
A PGR amplification reaction consisted of Pfu polymerase (Stratagene, LaJolla, CA) 
and/or Taq polymerase (Boehringer Mannheim, Germany) in the appropriate buffer 
including about 20picomoles of each amplificaiton primer and 200nanomoles of each 
deoxynucleoside triphosphate. Amplification reactions were performed according to 
the following scheme: 

First cycle 5 min at 95 °C 

2 mm at 51 "C 
6 min at 72 °C 

Cycles 2-13 45 sec at 95 °C 

35 sec at 49 °C 
10 min at 72 °C 



Cycles 14-30 25 sec at 95 "C 

35 sec at 47 "C 
10 min at 72''C 

Upon completion of the amplification reaction, DNA fragments were cloned either 
blunt-ended or, after EcdRi. digestion, into EcdRl digested pSUKS or pWKS30 
vectors and transformed into bacteria. Positively-selected clones were then analyzed 
for the presence of recombinant inserts, which were sequenced as described above 
in Example 4. 

As a result of these experiments, three clones encoding the hemoglobin 
receptor genes from N. meningitidis serotypes A and B and N. gonorrhoeae MS 11 A 
were cloned and the sequence of these genes determined. The nucleic acid sequence 
for each of these genes are shown in Figures 7 (A^. meningitidis, serotype A), 8 (N. 
meningitidis, serotype A) and 9 {N. gonorrhoeae MSllA). 

The degree of homology between the cloned hemoglobin receptors from the 
different N. meningitidis serotypes and A^. gonorrhoeae MSllA was assessed by 
nucleic acid and amino acid sequence comparison, as described in Example 5 above. 
The results of these comparisons are shown in Figures 10 and 11, respectively. 
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Hemoglobin receptor genes from the three A', meningitidis serotypes and A^. 
gonorrhoeae MSllA were found to be from 86.5% to 93.4% homologous; the most 
homologous nucleic acids were A', meningitidis serotypes B and C, and the most 
divergent nucleic acids were N. meningitidis serotype B and A^. gonorrhoeae MSI lA 
5 (Figure 10 and Table ffl). Hemoglobin receptor proteins from all four Neisseria 
species showed a high degree of homology to the other members of the group, 
ranging from 87% homology between the hemoglobin receptor proteins from A^. 
gonorrhoeae MS 11 A and A^. meningitidis serotype B to 93% homology between 
hemoglobin receptor proteins from A^. meningitidis serotypes A and B (Figure 11). 
10 In this comparison, all four receptors were found to share 84.7% amino acid 
sequence identity, and up to 11.6% sequence similarity (i.e., chemically-related 
amino acid residues at homologous sites within the amino acid sequence). The non- 
conserved amino acids were found clustered in the regions of the amino acid 
sequence corresponding to the external loops in the predicted topographical structure 
15 of the hemoglobin receptor proteins. 

It should be understood that the foregoing disclosure emphasizes certain 
specific embodiments of the invention and that all modifications or alternatives 
equivalent thereto are within the spirit and scope of the invention as set forth in the 
appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

Itl ^^^^^^ Sciences University 

!?; cT^^^iii^j,''- ^^-^ -^^^ 

(D) STATE: Oregon 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 97201-3098 

(G) TELEPHONE: 503-494-8200 

(H) TELEFAX: (503 ) -494 -4729 

(ii) TITLE^OF^INVENTION: A Novel Bacterial Hemoglobin Receptor 
(iii) NUMBER OF SEQUENCES: 18 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 
(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/ 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



x) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2373 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG AAA CCA TTA CAA ATG CTC CCT ATC GCC GCG CTG GTC GGC AGT ATT 
Met Lys Pro Leu Gin Met Leu Pro He Ala Ala Leu Val ter Jle 
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TTC GGC AAT CCG GTC TTG GCA GCA GAT GAA GCT GCA ACT GAA ACC ACA 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG ATA AAA GCA GTG CGC GTT AAA GGT CAG CGC AAT 
Pro Val Lys Ala Glu lie Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAT AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 
Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGT GTA AAC CTG CCT GAT 
Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT TTG TCT ATC GAC CCC GAA CTC GTA CGC AAT ATT GAA ATC GTG AAG 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Val Lys 
130 135 140 

GGC GCA GAC TCT TTC AAT ACC GGC AGT GGT GCA TTG GGC GGC GGT GTG 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACG CTG CAA GGC CGT GAT TTG CTG TTG GAC GAC AGG CAA 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGT GTG AGT AAC GAC CGC GTG GAT GCT GCT 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACC GAA AGC GCG GGC AAC 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Asn 
210 215 220 

CGC GGC TAT CCG GTA GAA GGT GCG GGT AAA GAA ACG AAT ATC CGC GGT 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Lys Glu Thr Asn He Arg Gly 
225 230 235 240 

TCC GCC CGC GGC ATC CCC GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 
Ser Ala Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAT 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 
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AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AGA CGG 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arq Arq 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ATG COT GAT TCA AAT TGG TTG 
Arg Asn Ala Asn Leu Phe Tyr Glu Trp Met Pro Asp Ser Asn Trp Leu 
305 310 315 320 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG AAA ACC AAA GTG GCG GCG 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Lys Thr Lys Val Ala Ala 
325 330 335 

ATT AAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACT GAG 
He Asn Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr Glu 
340 345 350 

TAC CAT AAA AAG GAA GTT GGC GAA ATA TAC AAC CGC AGC ATG GAC ACC 
Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp Thr 
355 360 365 

CGA TTC AAA CGT TTT ACT TTG CGT TTG GAC AGC CAT CCG TTG CAA CTC 
Arg Phe Lys Arg Phe Thr Leu Arg Leu Asp Ser His Pro Leu Gin Leu 
370 375 380 

GGG GGG GGG CGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC CGC CGT 
Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

GAT TTT GAA AAC CTA AAC CGC GAC GAT TAT TAC TTC AGC GGC CGT GTT 
Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg Val 
405 410 415 

GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC AAC TAC 
Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAC CAA ATT CAA TGG AAC GAC GTG TTC AGT AGC 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGC GCA GGT ATC CGT TAC GAC CAC ACC AAA ATG ACG CCT CAG GAA TTG 
Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCA CCT GCA GCC AAC ACT 
Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG AAT CAG 
Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 
Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT TGG CTG 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 
Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGC ATG CTG GAT GCC AAC CTG TAT CAA 
Leu Gin Gly Arg Ser Glu Lys Gly Met Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 
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AGC AAT TAC CGC AAT TTC CTG TCT GAA GAG CAG AAG CTG ACC ACC AGC 1728 
Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

GGC ACT CCC GGC TGT ACT GAG GAA AAT GCT TAC TAC AGT ATA TGC AGC 1776 
Gly Thr Pro Gly Cys Thr Glu Glu Asn Ala Tyr Tyr Ser He Cys Ser 
580 585 590 

GAC CCC TAC AAA GAA AAA CTG GAT TGG CAG ATG AAA AAT ATC GAC AAG 1824 
Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn He Asp Lys 
595 600 605 

GCC AGA ATC CGC GGT ATC GAG CTG ACA GGC CGT CTG AAT GTG GAC AAA 1872 
Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp Lys 
610 615 620 

GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA CTG TTC GGC TCG CTG GGT 1920 
Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
625 630 635 640 

TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA CAG 1968 
Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

CCG CTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC GAA AAA 2 016 

Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

TGG GGC GTA TTC TCC CGC CTG ACC TAT CTG GGC GCG AAA AAG GTC AAA 2064 
Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val Lys 
675 680 685 

GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG CCT TTG 2112 
Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro Leu 
690 695 700 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 2160 
Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
■705 710 715 720 

TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACC CTG CGT 2208 
Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr Leu Arg 
725 730 735 

GCG GGC GTG TAC AAC CTG TTC AAC CGC AAA TAC ACC ACT TGG GAT TCC 2256 
Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

CTG CGC GGT TTA TAT AGC TAC AGC ACC ACC AAT GCG GTC GAC CGC GAT 2304 
Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 

GGC AAA GGC TTA GAT CGC TAC CGC GCC CCA GGC CGC AAT TAC GCC GTA 2352 
Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala Val 
770 775 780 

TCG CTG GAA TGG AAG TTT TAA 2373 
Ser Leu Glu Trp Lys Phe * 
785 790 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Lys Pro Leu Gin Met Leu Pro lie Ala Ala Leu Val Gly Ser lie 
15 10 15 

Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu He Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Glv 
^5 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Val Lvs 
130 135 140 ^ 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
150 155 160 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 



Arg Gly Tyr Pro Val Glu Gly Ala Gly Lys Glu Thr Asn He Arg Gly 
225 230 235 240 

Ser Ala Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arq Arg 
290 295 300 

Arg Asn Ala Asn Leu Phe Tyr Glu Trp Met Pro Asp Ser Asn Trp Leu 
305 310 315 320 
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Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Lys Thr Lys Val Ala Ala 
325 330 335 

He Asn Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr Glu 
340 345 350 

Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp Thr 
355 360 365 

Arg Phe Lys Arg Phe Thr Leu Arg Leu Asp Ser His Pro Leu Gin Leu 
370 375 380 

Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg Val 
405 410 415 

Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 



Leu Gin Gly Arg Ser Glu Lys Gly Met Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

Gly Thr Pro Gly Cys Thr Glu Glu Asn Ala Tyr Tyr Ser He Cys Ser 
580 585 590 

Asp Pro Tyr Lys Glu Lys Leu Asp Trp Qln Met Lys Asn He Asp Lys 
595 600 605 

Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp Lys 
610 615 620 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
625 630 635 640 

Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 
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Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val Lys 
675 G80 685 

Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro Leu 
690 695 700 

Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr Leu Arg 
725 730 735 ^ 

Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala Val 
770 775 780 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2375 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG AAA CCA TTA CAA ATG CCC CCT ATC GCC GCG CTG CTC GGC AGT ATT 
Met Lys Pro Leu Gin Met Pro Pro He Ala Ala Leu Leu Gly Ser He 
15 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT QAA GCT GCA ACT GAA ACC ACA 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGT CAG CGC AAT 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAT AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 
Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
^5 70 75 80 

TTG AGC GAC AGG AGC CGT CAT CAA AAA GGC TTT GCC ATT CGC GGC GTG 
Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala He Arg Gly Val 
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GAA GGC GAC CGT GTC GGC GTT AGT ATT GAC GGC GTA AAC CTG CCT GAT 
Glu Gly Asp Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lvs 
130 135 140 ' 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Glv Val 
150 155 ^ ^ y 

AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arq Glu Trc 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Glv Lvs 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 
Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tvr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Ara Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACC 
Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 350 

GAG TAC CAT AAA AAG GAA GTT GGC GAA ATC TAT AAC CGC AGC ATG GAT 
Glu Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp 
355 360 365 
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^"^^ ATG GAC AGC CAT CCG TTG CAA 

Thr Thr Phe Lys Arg He Thr Leu Arg Met Asp Ser His Pro Leu Gin 
375 380 

CTC GGG GGG GGG CGA CAC CGC CTG TCG TTC AAA ACC TTT GCC GGG CAG 
Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Gly Gin 
390 

CGT GAT TTT GAA AAC TTA AAC CGC GAC GAT TAG TAC TTC AGC GGC CGT 
Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glv Ara 
405 410 ^ 

QTT GTT CGA ACC ACC AAC AGT ATC CAG CAT CCG GTG AAA ACC ACC AAC 
Val Val Arg Thr Thr Asn Ser He Gin His Pro Val Lys Thr Thr Asn 
420 425 430 



?vr ril ir r° ""^^ ^ GTG TTC AGT 

Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser 
435 440 

sir A^S t^"" r?"" tT"" ^'''^ ^ CCT CAG GAA 

Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu 
4=0 455 460 

TTG AAT GCC GAC TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAC 
Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn 
'^■^O 475 480 

ACT TAT AAA GGC TGG AGC GGA TTT GTC GGC TTG GCG GCG CAG CTG AGC 
Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Ser 
485 490 

CAA ACA TGG CGT TTG GGT TAC GAT GTG ACC TCA GGT TTC CGC GTG CCG 

Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arq Val Pro 
500 505 510 

AAT GCG TCT GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGC ACT TGG 
Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Thr Trp 
515 520 525 

AAG CCT AAT CCT AAT TTG AAG GCA GAA CGC AGC ACC ACC CAC ACC CTG 
Lys Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu 
=30 535 54Q 

TCC TTG CAG GGG CGC GGC GAC AAA GGG ACA CTG GAT GCC AAC CTG TAT 
ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Ala Asn Leu T^ 
550 555 5^0 

CAA AGC AAT TAC CGA AAC TTC CTG TCG GAA GAG CAG AAT CTG ACT GTC 
Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 
565 570 575 

AGC GGC ACA CCC GGC TGT ACT GAG GAG GAT GCT TAC TAC TAT AGA TGC 
Ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr Arg Cvs 
580 585 590 

AGC GAC CCC TAC AAA GAA AAA CTG GAT TGG CAG ATG AAA AAT ATC GAC 
Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn He Asp 
595 600 605 

AAG GCC AGA ATC CGC GGT ATC GAG TTG ACA GGC CGT CTG AAT GTG GAC 
Lys Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp 
610 615 
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AAA GTA GCG TCT TTT GTT CCT GAG GGT TGG AAA CTG TTC GGC TCG CTG 
Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu 
"0 635 640 

GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA 
Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr 
650 655 

CAG CCG CTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC GAA 
Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu 
660 665 670 



AAA TGG GGC GTA TTC TCC CGC CTG ACC TAT CTA GGC GCG AAA AAG GTC 
Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lvs Val 
675 680 685 

AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG CCT 
Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 
690 695 700 

TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT 
Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 
710 715 720 

GTG TTT GAT ATG TAC GGC TTC TAC AAA CCG GCT AAA AAC CTG ACT TTG 
Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 
725 730 

CGT GCA GGC GTG TAC AAC CTG TTC AAC CGC AAA TAC ACC ACT TGG GAT 
Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp 
740 745 750 

TCC CTG CGC GGT TTA TAT AGC TAC AGC ACC ACC AAT GCG GTC GAC CGC 
Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg 
755 760 765 

GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA GGC CGC AAT TAC GCC 
Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala 
770 775 -jQQ 

GTA TCG CTG GAA TGG AAG TTT TAA 
Val Ser Leu Glu Trp Lys Phe * 
785 790 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Pro Leu Gin Met Pro Pro He Ala Ala Leu Leu Gly Ser He 
is 10 15 

Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 



1920 
1968 
2016 

2064 
2112 
2160 
2208 

2256 

2304 

2352 

2375 
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Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Glv 
65 70 75 80 

Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala He Arg Gly Val 
85 90 95 

Glu Gly Asp Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 



Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
ISO 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 235 240 

Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 350 

Glu Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp 
355 360 365 

Thr Thr Phe Lys Arg He Thr Leu Arg Met Asp Ser His Pro Leu Gin 
370 375 380 

Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Gly Gin 
385 390 395 400 
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Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg 

Val Val Arg Thr Thr Asn Ser He Gin His Pro Val Lys Thr Thr Asn 

Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser 
435 

Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu 

455 460 
Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn 

Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Ser 
485 490 495 

Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arg Val Pro 
" 505 

Asn Ala ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Thr Trp 
520 525 

Lys Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu 

Ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Ala Asn Leu Tyr 

555 550 

Gin ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 
565 570 

ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr Arg Cys 

585 590 
Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn He Asp 

600 605 
Lys Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp 

615 620 
Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu 
"° 635 640 

Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr 

645 650 655 

Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu 

660 665 670 

Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val 

Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 
695 700 

Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 

■715 720 

val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 
725 730 735 

Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp 
745 750 
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Ser Leu Arg Gly Leu Tyr Ser Tyr 
755 760 

Asp Gly Lys Gly Leu Asp Arg Tyr 
770 775 

Val Ser Leu Glu Trp Lys Phe 
785 790 



Ser Thr Thr Asn Ala Val Asp Arg 
765 

Arg Ala Pro Gly Arg Asn Tyr Ala 

780 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATG AAA CCA TTA CAA ATG CTC CCT ATC GCC GCG CTG GTC GGC AGT ATT 
Met Lys Pro Leu Gin Met Leu Pro He Ala Ala Leu Val Gly Ser He 
15 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGC CAG CGC AAT 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 
Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
€5 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGC GTA AAC CTG CCT GAT 
Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TOT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 
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AAT TAG CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 52 8 

Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAG AGC ACG GGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGG GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GGG GGC AAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT GGG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAG GAC AGC TTC 768 
Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GGA 816 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAG ACG GTT GAA GAG TCT TAG 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTG AAG AGA GGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGT AAC ACG AAC CTG TTT TAG GAA TGG ACG CCG GAA TCC GAC CGG TTG 960 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTG GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAG TAC AAA GGT TCG TTC GGG ATA GAG GAT TCT TGG ACC TTG ACA 1056 
Val Asn Tyr Lys Gly Ser Phe Pro He Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

CGT AAC TAC AAT CAA AAG GAC TTG GAT GAA ATC TAC AAC CGC AGT ATG 1104 
Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu He Tyr Asn Arg Ser Met 
355 360 365 

GAT ACC CGC TTC AAA CGC ATT ACC CTG CGT TTG GAC AGC CAT CCG TTG 1152 
Asp Thr Arg Phe Lys Arg He Thr Leu Arg Leu Asp Ser His Pro Leu 
370 375 380 

CAA CTC GGG GGG GGG GGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC 12 00 

Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
385 390 395 400 

CGC CGT GAT TTT GAA AAC GTA AAC CGC GAC GAT TAT TAC TTC AGC GGC 1248 
Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly 
405 410 415 

CGT GTT GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC 1296 
Arg Val Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr 
420 425 430 
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AAC TAC GGT TTC TCA CTG TCT GAC CAA ATT CAA TGG AAC GAC GTG TTC 
Asn Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe 
435 440 445 

AGT AGC CGC GCA GGT ATC CGT TAC GAT CAT ACC AAA ATG ACQ CCT CAG 
Ser Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
450 455 460 

GAA TTG AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC 
Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
465 470 475 480 

AAC ACT TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG 
Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
485 490 495 

AAT CAG GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC 
Asn Gin Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val 
500 505 510 

CCC AAT GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT 
Pro Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Glv Asn 
515 520 525 

TGG CTG CCC AAT CCC AAC CTG AAA GCC GAG CGC ACG ACC ACC CAC ACC 
Trp Leu Pro Asn Pro Asn Leu Lys Ala Glu Arg Thr Thr Thr His Thr 
530 535 540 

CTC TCT CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG 
Leu Ser Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu 
545 550 555 560 

TAT CAA AGC AAT TAC CGC AAT TTC CTG TCT GAA GAG CAG AAG CTG ACC 
Tyr Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr 
565 570 575 

ACC AGC GGC GAT GTC AGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG 
Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 585 590 

TGT AGC AAT CCT TAT TCC GAA AAA CTG GAA TGG CAG ATG CAA AAT ATC 
Cys Ser Asn Pro Tyr Ser Glu Lys Leu Glu Trp Gin Met Gin Asn He 
595 600 605 

GAC AAG GCC AGA ATC CGC GGT ATC GAG CTG ACG GGC CGT CTG AAT GTG 
Asp Lys Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val 
eiO 615 620 

GAC AAA GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA CTG TTC GGC TCG 
Asp Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Glv Ser 
625 630 635 640 

CTG GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC 
Leu Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser 
645 650 655 

ACC CAG CCG TTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC 
Thr Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser 
660 665 670 

GAA AAA TGG GGC GTG TTC TCC CGC CTG ACC TAT CTG GGC GCG AAA AAG 
Glu Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys 
675 680 685 

GTC AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG 
Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 
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CCT TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT 2160 
Pro Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala 
705 710 715 720 

TAT GTG TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACT 220 8 

Tyr Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr 
725 730 735 

TTG CGT GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG 2256 
Leu Arg Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp 
740 745 750 

GAT TCC CTG CGC GGC CTG TAT AGC TAC AGC ACC ACC AAC TCG GTC GAC 23 04 

Asp Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ser Val Asp 
755 760 765 

CGC GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA AGC CGT AAT TAC 2352 
Arg Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Ser Arg Asn Tyr 
770 775 780 

GCC GTA TCG CTG GAA TGG AAG TTT TAA 2 3 79 

Ala Val Ser Leu Glu Trp Lys Phe * 
785 790 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 6: 

Met Lys Pro Leu Gin Met Leu Pro He Ala Ala Leu Val Gly Ser He 
15 10 15 

Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 SO 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 



Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 
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Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
230 235 240 

Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
310 315 ^ 

Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

Val Asn Tyr Lys Gly Ser Phe Pro He Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu He Tyr Asn Arg Ser Met 
355 360 365 

Asp Thr Arg Phe Lys Arg He Thr Leu Arg Leu Asp Ser His Pro Leu 



Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
390 395 400 

Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glv 
405 410 415 

Arg Val Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr 
420 425 430 

Asn Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe 
435 440 445 

Ser Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
450 455 

Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
470 475 480 

Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
485 490 

Asn Gin Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arc Val 
500 505 510 
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Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 585 590 



Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 



(2) INFORKATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: cDNA 
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(B) LOCATION: 1. .2373 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATG AAA CCA TTA CAC ATG CTT CCT ATT GCC GCG CTG GTC GGC AGT ATT 4 8 

Met Lys Pro Leu His Met Leu Pro He Ala Ala Leu Val Gly Ser He 
15 10 15 

TTC GGC AAT CCG GTC TTG GCA GCG GAT GAA GCT GCA ACC GAA ACC ACA 96 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAA GCA GAG ATA AAA GAA GTG CGC GTT AAA GAC CAG CTT AAT 144 
Pro Val Lys Ala Glu He Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

GCG CCT GCA ACC GTG GAA CGT GTC AAC CTC GGC CGC ATT CAA CAG GAA 192 
Ala Pro Ala Thr Val Glu Arg Val Asn Leu Gly Arg He Gin Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGT TAC TCC ACC GAC GTC GGC 240 
Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAT AGC GGC CGC CAT CAA AAA GGC TTT GCT GTG CGC GGC GTG 288 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGT GTC AGC ATT GAC GGC GTG AGC CTG CCT GAT 3 36 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Ser Leu Pro Asp 
100 105 110 

TCG GAA GAA AAC TCA CTG TAT GCA CGT TAT GGC AAC TTC AAC AGC TCG 3 84 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGC CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAA ATC GCG AAG 432 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Ala Lys 
130 135 140 

GGC GCT GAC TCT TTC AAT ACC GGT AGC GGC GCA TTG GGT GGC GGC GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CAT GAT TTG CTG TTG GAC GAC AGG CAA 52 8 

Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC AGC CGC AAC CGC GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACA CTC GGT TTC GGT GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGT CGC GGT CAT GAG ACC GAA AGC GCG GGC GAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

CGT GGC TAT CCG GTA GAG GGT GCT GGC AGC GGA GCA ATT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala He He Arg Gly 
225 230 235 240 

TCG TCA CGC GGT ATC CCT GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 768 
Ser Ser Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 
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TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAG CAC CGC ATC GGC CCA 816 
Leu Gly Lys lie Ala Tyr Gin He Asn Asp Lys His Arg He Gly Pro 
260 265 270 

TCG TTT AAC GGC CAG CAG GGG CAT AAT TAC ACG ATT GAA GAG TCT TAT 864 
Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr He Glu Glu Ser Tyr 
275 280 285 

AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AGA CGG 912 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ACG CCT GAT TCA AAT TGG CTG 960 
Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
305 310 315 320 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG ACA ACC AAA GTG GCG GCG 1008 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 335 

GTT AAC AAC AAA GGC TCG TTC CCG ACG GAT TAT TCC ACC TGG ACG CGC 1056 
Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Arg 
340 345 350 

AAC TAT AAT CAG AAG GAT TTG GAG AAT ATA TAC AAC CGC AGC ATG GAC 1104 
Asn Tyr Asn Gin Lys Asp Leu Glu Asn He Tyr Asn Arg Ser Met Asp 
355 360 365 

ACC CGA TTC AAA CGT TTT ACT TTG CGT ATG GAC AGC CAA CCG TTG CAA 1152 
Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 380 

CTG GGC GGC CAA CAT CGC TTG TCG CTT AAA ACT TTC GCC AGT CGG CGT 1200 
Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

GAG TTT GAA AAC TTA AAC CGC GAC GAT TAT TAC TTC AGC GAA AGA GTA 1248 
Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 415 

TCC CGT ACT ACC AGC TCG ATT CAA CAC CCC GTG AAA ACC ACT AAT TAT 1296 
Ser Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAT CAA ATC CAA TGG AAC GAC GTG TTC AGC AGC 1344 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGT GCA GAT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG GAA TTG 1392 
Arg Ala Asp He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAT ACT 144 0 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGA TTT GTC GGT TTG GCG GCG CAA CTG AAT CAG 1488 
Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CAT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 1536 
Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT TGG CTG 1584 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 
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CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 
Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG TAT CAA 
Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

AAC AAT TAC CGC AAC TTC TTG TCT GAA GAG CAG AAG CTG ACC ACC AGC 
Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

GGC GAT GTC GGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG TGT AGC 
Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cys Ser 
580 585 590 

AAT CCT TAT TCC GAA AAA CCG GAA TGG CAG ATG CAA AAT ATC GAT AAG 
Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
595 600 605 

GCC CGA ATC CGT GGT CTT GAG CTG ACA GGC CGT CTG AAT GTG ACA AAA 
Ala Arg He Arg Gly Leu Glu Leu Thr Gly Arg Leu Asn Val Thr Lys 
610 615 620 

GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA TTG TTC GGC TCG CTG GGT 
Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Glv 
625 630 635 640 

TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA CAG 
Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

CCG CCG AAA GTG ATT GCC GGT GTC GAC TAC GAA AGC CCG AGC GAA AAA 
Pro Pro Lys Val He Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

TGG GGT GTG TTC TCC CGC CTG ACT TAT CTG GGT GCG AAA AAG GCC AAA 
Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lys 
675 680 685 

GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC CGG GGT ACG CCT TTG 
Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Arg Gly Thr Pro Leu 
690 695 700 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 
Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

TTT GAT ATG TAC GGC TTC TAC AAA CTG GCT AAA AAC CTG ACT TTG CGT 
Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arg 
725 730 735 

GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG GAT TCC 
Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

CTG CGC GGT TTG TAT AGC TAC AGC ACC ACC AAC GCG GTC GAC CGA GAT 
Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Aan Ala Val Asp Arg Asp 
755 760 765 

GGC AAA GGC TTA GAC CGC TAC CGC GCC TCA GGC CGT AAT TAC GCC GTA 
Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 
770 775 780 

TCG CTG GAT TGG AAG TTT TGA ATTCC 
Ser Leu Asp Trp Lys Phe * 
785 790 
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(2) INFORMATION FOR SEQ ID NO:B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys Pro Leu His Met Leu Pro lie Ala Ala Leu Val Gly Ser He 
15 10 15 

Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

Pro Val Lys Ala Glu He Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

Ala Pro Ala Thr Val Glu Arg Val Asn Leu Gly Arg He Gin Gin Glu 
50 55 60 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Ser Leu Pro Asp 
100 105 110 

Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Ala Lys 
130 135 140 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala He He Arg Gly 
225 230 235 240 

Ser Ser Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 

Leu Gly Lys He Ala Tyr Gin He Asn Asp Lys His Arg He Gly Pro 
260 265 270 

Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr He Glu Glu Ser Tyr 
275 280 285 

Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 
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Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
310 

Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 

Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Ara 
340 345 350 ^ 

Asn Tyr Asn Gin Lys Asp Leu Glu Asn He Tyr Asn Arg Ser Met Asp 
355 360 365 

Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 

Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Arg Arq 
390 395 400 

Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 4i| 

Ser Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

Arg Ala Asp He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 

450 455 

Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
-i^O 475 

Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 

Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 54Q 

Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 
550 555 ^ 

Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cys Ser 
580 585 590 

Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
595 600 605 

Ala Arg He Arg Gly Leu Glu Leu Thr Gly Arg Leu Asn Val Thr Lvs 
610 615 620 

Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
"5 630 635 640 

Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 



- 68 - 



wo 96/12020 



PCTAJS95/13623 



Pro Pro Lys Val He Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lys 
675 580 685 

Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Arg Gly Thr Pro Leu 
690 695 700 

Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arq 
725 730 735 

Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 

Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 
770 775 780 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Gin Gin Gin His Leu Phe Arg Leu Asn He Leu Cys Leu Ser Leu 
^5 10 15 

Met Thr Ala Leu Pro Val Tyr Ala Glu Asn Val Gin Ala Glu Gin Ala 
20 25 30 

Gin Glu Lys Gin Leu Asp Thr He Val Lys Ala Lys Lys Gin Lys Thr 
35 40 45 

Arg Arg Asp Asn Glu Val Thr Gly Leu Gly Lys Leu Val Lys Ser Ser 
50 55 60 

Asp Thr Leu Ser Lys Glu Gin Val Leu Asn He Arg Asp Leu Thr Arg 
S5 70 75 80 

Tyr Asp Pro Gly He Ala Val Val Glu Gin Gly Arg Gly Ala Ser Ser 
85 90 95 

Gly Tyr Ser He Arg Gly Met Asp Lys Asn Arg Val Ser Leu Thr Val 
100 105 110 

Asp Gly Val Ser Gin He Gin Ser Tyr Thr Ala Gin Ala Ala Leu Glv 
115 120 125 

Gly Thr Arg Thr Ala Gly Ser Ser Gly Ala He Asn Glu He Glu Tyr 
130 135 140 
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Glu ASH val Lys Ala Val Glu He Ser Lys Gly Ser Asn Ser Ser Glu 
150 155 

Tyr Gly Asn Gly Ala Leu Ala Gly Ser Val Ala Phe Gin Thr Lys Thr 
165 170 

Ala Ala Asp He He Gly Glu Gly Lys Gin Trp Gly He Gin Ser Lys 
180 185 190 

Thr Ala Tyr Ser Gly Lys Asp His Ala Leu Thr Gin Ser Leu Ala Leu 



Ala Gly Arg Ser Gly Gly Ala Glu Ala Leu Leu He Tyr Thr Lvs Ara 
210 215 220 J- a 

Arg Gly Arg Glu He His Ala His Lys Asp Ala Gly Lys Gly Val Gin 
225 230 235 240 

Ser Phe Asn Arg Leu Pro He Cys Arg Phe Gly Asn Asn Thr Tyr Thr 
245 250 255 

Asp Cys Thr Pro Arg Asn He Gly Gly Asn Gly Tyr Tyr Ala Ala Val 
260 265 270 

Gin Asp Asn Val Arg Leu Gly Arg Trp Ala Asp Val Gly Ala Glv He 
275 280 285 

Arg Tyr Asp Tyr Arg Ser Thr His Ser Glu Asp Lys Ser Val Ser Thr 
290 295 300 

Gly Thr His Arg Asn Leu Ser Trp Asn Ala Gly Val Val Leu Lys Pro 
310 315 

Phe Thr Trp Met Asp Leu Thr Tyr Arg Ala Ser Thr Gly Phe Arg Leu 
325 330 335 

Pro Ser Phe Ala Glu Met Tyr Gly Trp Arg Ala Gly Glu Ser Leu Lvs 
340 345 350 

Thr Leu Asp Leu Lys Pro Glu Lys Ser Phe Asn Arg Glu Ala Glv He 
355 360 365 

Val Phe Lys Gly Asp Phe Gly Asn Leu Glu Ala Ser Tyr Phe Asn Asn 
370 375 380 

Ala Tyr Arg Asp Leu He Ala Phe Gly Tyr Glu Thr Arg Thr Gin Asn 
385 390 395 400 

Gly Gin Thr Ser Ala Ser Gly Asp Pro Gly Tyr Arg Asn Ala Gin Asn 
405 410 

Ala Arg He Ala Gly He Asn He Leu Gly Lys He Asp Trp His Glv 
420 425 430 

Val Trp Gly Gly Leu Pro Asp Gly Leu Tyr Ser Thr Leu Ala Tyr Asn 
435 440 445 

Arg He Lys Val Lys Asp Ala Asp Arg Ala Asp Arg Thr Phe Val Thr 
450 455 460 

Ser Tyr Leu Phe Asp Ala Val Gin Pro Ser Arg Tyr Val Leu Gly Leu 
465 470 475 480 

Gly Tyr Asp His Pro Asp Gly He Trp Gly He Asn Thr Met Phe Thr 
485 490 495 

Tyr Ser Lys Ala Lys Ser Val Asp Glu Leu Leu Gly Ser Gin Ala Leu 
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Leu Asn Gly Asn Ala Asn Ala Lys Lys Ala Ala Ser Arg Arg Thr Arg 
515 520 525 



Leu Thr Leu Arg Ala Gly Val Tyr Asn Leu Leu Asn Tyr Arg Tyr Val 
545 550 555 560 

Thr Trp Glu Asn Val Arg Gin Thr Ala Gly Gly Ala Val Asn Gin His 
565 570 575 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 07 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECITLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asn Lys Lys His Gly Phe Gin Leu Thr Leu Thr Ala Leu Ala Val 
15 10 15 

Ala Ala Ala Phe Pro Ser Tyr Ala Ala Asn Pro Glu Thr Ala Ala Pro 
20 25 30 

Asp Ala Ala Gin Thr Gin Ser Leu Lys Glu Val Thr Val Arg Ala Ala 
35 40 45 

Lys Val Gly Arg Arg Ser Lys Glu Ala Thr Gly Leu Gly Lys He Ala 
50 55 60 

Lys Thr Ser Glu Thr Leu Asn Lys Glu Gin Val Leu Gly He Arg Asp 
65 70 75 80 

Leu Thr Arg Tyr Asp Pro Gly Val Ala Val Val Glu Gin Gly Asn Gly 
85 90 95 

Ala Ser Gly Gly Tyr Ser He Arg Gly Val Asp Lys Asn Arg Val Ala 
100 105 110 



He Glu Tyr Glu Asn He Ser Thr Val Glu He Asp Lys Gly Ala Gly 
145 150 155 160 

Ser Ser Asp His Gly Ser Gly Ala Leu Gly Gly Ala Val Ala Phe Arg 
165 170 175 
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Thr Lys Glu Ala Ala Asp Leu He Ser Asp Gly Lys Ser Trp Glv He 
180 185 190 

Gin Ala Lys Thr Ala Tyr Gly Ser Lys Asn Arg Gin Phe Met Lvs Ser 
195 200 205 

Leu Gly Ala Gly Phe Ser Lys Asp Gly Trp Glu Gly Leu Leu He Arg 
210 215 220 

Thr Glu Arg Gin Gly Arg Glu Thr His Pro His Gly Asp He Ala Asp 
225 230 235 240 

Gly Val Ala Tyr Gly He Asn Arg Leu Ser Val Cys Gly Tyr He Glu 
245 250 255 

Thr Leu Arg Ser Arg Lys Cys Val Pro Arg Lys He Asn Gly Ser Asn 
260 265 270 

He His He Ser Leu Asn Asp Arg Phe Ser He Gly Lys Tyr Phe Asp 
275 280 285 

Phe Ser Leu Gly Gly Arg Tyr Asp Arg Lys Asn Phe Thr Thr Ser Glu 
290 295 300 

Glu Leu Val Arg Ser Gly Arg Tyr Val Asp Arg Ser Trp Asn Ser Glv 
305 310 315 

He Val Phe Lys Pro Asn Arg His Phe Ser Leu Ser Tyr Arg Ala Ser 
325 330 335 

Ser Gly Phe Arg Thr Pro Ser Phe Gin Glu Leu Phe Gly He Asp He 
340 345 350 

Tyr His Asp Tyr Pro Lys Gly Trp Gin Arg Pro Ala Leu Lys Ser Glu 



Lys Ala Ala Asn Arg Glu He Gly Leu Gin Trp Lys Gly Asp Phe Gly 
370 375 380 

Phe Leu Glu He Ser Ser Phe Arg Asn Arg Tyr Thr Asp Met He Ala 
385 390 395 

Val Ala Asp His Lys Thr Lys Leu Pro Asn Gin Ala Gly Gin Leu Thr 
405 410 

Glu He Asp He Arg Asp Tyr Tyr Asn Ala Gin Asn Met Ser Leu Gin 
420 425 430 

Gly Val Asn He Leu Gly Lys He Asp Trp Asn Gly Val Tyr Gly Lys 
435 440 445 

Leu Pro Glu Gly Leu Tyr Thr Thr Leu Ala Tyr Asn Arg He Lys Pro 
450 455 460 

Lys Ser Val Ser Asn Arg Pro Gly Leu Ser Leu Arg Ser Tyr Ala Leu 
465 470 475 480 

Asp Ala Val Gin Pro Ser Arg Tyr Val Leu Gly Phe Gly Tyr Asp Gin 
485 490 495 

Pro Glu Gly Lys Trp Gly Ala Asn He Met Leu Thr Tyr Ser Lys Gly 
500 505 510 

Lys Asn Pro Asp Glu Leu Ala Tyr Leu Ala Gly Asp Gin Lys Arg Tyr 
515 520 525 
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lie Gly Asn Tyr Arg Tyr Val Thr Trp Glu Ser Leu Arg Gin Thr Ala 
565 570 575 

Glu Ser Thr Ala Asn Arg His Gly Gly Asp Ser Asn Tyr Gly Arg Tyr 
580 585 590 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AAACAGGTCT CGGCATAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGCGAATTCA AACAGGTCTC GGCATAG 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
CGCGAATTCA AAAACTTCCA TTCCAGCGAT ACQ 



(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TAAAACTTCC ATTCCAGCGA TACG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
AAACAGGTCT CGGCATAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGCGAATTCA AACAGGTCTC GGCATAG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGCGAATTCA AAAACTTCCA TTCCAGCGAT ACG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TAAAACTTCC ATTCCAGCGA TACG 
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WHAT WE CLAIM IS: 

1. An isolated and purified recombinant nucleic acid encoding a 
hemoglobin receptor protein from a Neisseria species. 

2. An isolated and purified recombinant nucleic acid according to Claim 
1, wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 2. 

3 . An isolated and purified recombinant nucleic acid according to Claim 
1, wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 4. 

4. An isolated and purified recombinant nucleic acid according to Claim 
1, wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 6. 

5 . An isolated and purified recombinant nucleic acid according to Claim 
1, wherein the nucleic acid encodes a hemoglobin receptor protein having an amino 
acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 8. 

6. A homogeneous preparation of a hemoglobin receptor protein from a 
Neisseria species. 

7. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 
2. 

8. The hemoglobin receptor protein of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 
4. 

9. The hemoglobin receptor protein of Claim 6, wherem the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 
6. 

10. The hemoglobin receptor protem of Claim 6, wherein the protein has 
an amino acid sequence that is the amino acid sequence depicted as Seq. I.D. No. 
8. 

11. A recombinant expression construct comprising a nucleic acid that 
encodes a hemoglobin receptor protein from a Neisseria species. 
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12. A transformed cell culture comprising the recombinant expression 
construct of Claim 1 1 . 

13. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 

5 sequence that is the amino acid sequence depicted as Seq. I.D. No. 2. 

14. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 
sequence that is the amino acid sequence depicted as Seq. I.D. No. 4. 

15. A recombinant expression construct according to Claim 11, wherein 
10 the nucleic acid encodes a hemoglobin receptor protein having an amino acid 

sequence that is the amino acid sequence depicted as Seq. I.D. No. 6. 

16. A recombinant expression construct according to Claim 11, wherein 
the nucleic acid encodes a hemoglobin receptor protein having an amino acid 
sequence that is the amino acid sequence depicted as Seq. I.D. No. 8. 

15 17. A transformed cell culmre comprising the recombinant expression 

construct of Claims 13, 14, 15 or 16. 

18. An antibody or antigen-binding fragment thereof that is 

immunologically reactive with an antigenic epitope of a hemoglobin receptor protein 

from a Neisseria species. 
20 19. An antibody according to Claim 18 that is a monoclonal antibody. 

20. An antibody or antigen-binding fragment thereof according to Claim 
18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 2. 

21. An antibody or antigen-binding fragment thereof according to Claim 
25 18 that is inmiunologically reactive with an antigenic epitope of the hemoglobin 

receptor protein depicted as Seq. I.D. No. 4. 

22. An antibody or antigen-binding fragment thereof according to Claim 
18 that is immunologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 6. 

30 23. An antibody or antigen-binding fragment thereof according to Claim 

18 that is immimologically reactive with an antigenic epitope of the hemoglobin 
receptor protein depicted as Seq. I.D. No. 8. 
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24. An antigenic epitope of a hemoglobin receptor protein from a 
Neisseria species. 

25. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 2. 

26. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 4. 

27. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 6. 

28. The antigenic epitope of Claim 24 wherein the hemoglobin receptor 
protein is the protein depicted as Seq. I.D. No. 8. 

29. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising an antibody according to Claims 18, 20, 21, 22, or 23. 

30. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising an antibody according to Claim 19. 

31. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising the nucleic acid of Claim 1. 

32. A diagnostic reagent for diagnosing a disease state in a human, 
wherein the disease state is caused by bacteria of a Neisseria species, the diagnostic 
reagent comprising the nucleic acid of Claims 2, 3, 4 or 5. 

33. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising an antibody according to Claim 18, 20, 21, 22, or 23. 

34. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising an antibody according to Claim 19. 

35. A therapeutic agent for treating a disease state in a human, wherein 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising the nucleic acid of Claim 1 or antisense homologue thereof. 
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36. A therapeutic agent for treating a disease state in a human, wherem 
the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 
comprising the nucleic acid of Claims 2, 3, 4, or 5 or antisense homologue thereof. 

37. A therapeutic agent for treating a disease state in a human, wherein 
5 the disease state is caused by bacteria of a Neisseria species, the therapeutic agent 

comprising the recombinant expression construct of Claims 11, 13, 14, 15 or 16 or 
a homologue thereof that expresses the nucleic acid encoding a hemoglobin receptor 
in an antisense orientation. 

38. An antibody according to Claims 20, 21 , 22 or 23 that is a monoclonal 
10 antibody. 

39. A cell line that produces the monoclonal antibody of Claims 19 or 38. 

40. A method of treating a disease in a human caused by bacteria of a 
Neisseria species, the method comprising the step of administering a therapeutically- 
effective amount of the therapeutic agent of Claims 33, 34, 35, 36, or 37 in a 

15 pharmaceutically-acceptable carrier. 

41. A method of diagnosing a disease in a human caused by bacteria of 
a Neisseria species, the method comprismg the steps of contacting an amount of a 
detectably-Iabeled diagnostic reagent of Claims 29, 30, 31, or 32 to a biological 
sample from the himian under conditions wherein the diagnostic reagent specifically 

20 binds to the Neisseria bacteria and detecting an amount of the specific binding to the 
biological sample. 

42. A vaccine that is effective in providing immunization against infection 
of a human with a bacteria of Neisseria species comprising a hemoglobin binding 
protein or antigenic fragment thereof. 

25 43. The vaccine of Claim 42 comprising the hemoglobin receptor protein 

of Claims 6, 7, 8, 9, or 10. 

44. The vaccine of Claim 42 comprising a nucleic acid encoding a 
hemoglobin receptor protein from a Neisseria species or antigenic fragment thereof. 

45. A vaccine according to Claim 44 comprising the nucleic acid of 
30 Claims 2, 3, 4, 5, 11, 13, 14, 15, or 16. 

46. The vaccine of Claim 42 comprising cells of the transformed cell 
culture of Claim 17. 
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47. A vaccine according to Claim 46 wherein the cells are attenuated 
bacterial cells. 

48. A vaccine according to Claim 47 wherein the cells are Salmonella 

cells. 

5 49. The vaccine of Claim 42 comprising the epitope of the hemoglobin 

receptor protein of Claims 24, 25, 26, 27 or 28. 
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Figure 7 

ATG AAA CCA TTA CAA ATG CCC CCT ATC GCC GCG CTG CTC GGC AGT ATT 4 8 

Met Lys Pro Leu Gin Met Pro Pro lie Ala Ala Leu Leu Gly Ser lie 
15 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT TiAA GGT CAG CGC AAT 144 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg lie Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAT AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 24 0 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp val Gly 
65 70 75 80 

TTG AGC GAC AGG AGC CGT CAT CAA AAA GGC TTT GCC ATT CGC GGC GTG 288 
Leu Ser Asp Arg Ser Arg His Gin Lys Gly Phe Ala He Arg Gly Val 
85 90 95 

GAA GGC GAC CGT GTC GGC GTT AGT ATT GAC GGC GTA AAC CTG CCT GAT 336 
Glu Gly Asp Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 384 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 432 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 48 0 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 528 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 768 
Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys He Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 
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Figure 7 (cont'd.) 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 960 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ACG AAT TAC ACC ACA TGG GAA ACC 1056 
Val Asn Tyr Lys Gly Ser Phe Pro Thr Asn Tyr Thr Thr Trp Glu Thr 
340 345 350 

GAG TAC CAT AAA AAG GAA GTT GGC GAA ATC TAT AAC CGC AGC ATG GAT 1104 
Glu Tyr His Lys Lys Glu Val Gly Glu He Tyr Asn Arg Ser Met Asp 
355 360 365 

ACA ACC TTC AAA CGT ATT ACG CTG CGT ATG GAC AGC CAT CCG TTG CAA 1152 
Thr Thr Phe Lys Arg He Thr Leu Arg Met Asp Ser His Pro Leu Gin 
370 375 380 

CTC GGG GGG GGG CGA CAC CGC CTG TCG TTC AAA ACC TTT GCC GGG CAG 12 00 

Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Gly Gin 
385 390 395 400 

CGT GAT TTT GAA AAC TTA AAC CGC GAC GAT TAC TAC TTC AGC GGC CGT 124 8 

Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly Arg 
405 410 415 

GTT GTT CGA ACC ACC AAC AGT ATC CAG CAT CCG GTG AAA ACC ACC AAC 1296 
Val Val Arg Thr Thr Asn Ser He Gin His Pro Val Lys Thr Thr Asn 
420 425 430 

TAC GGT TTC TCG CTG TCC GAC CAA ATC CAA TGG AAC GAC GTG TTC AGT 1344 
Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser 
435 440 445 

AGC CGC GCA GGT ATC CGT TAC GAC CAC ACC AAA ATG ACG CCT CAG GAA 13 92 

Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu 
450 455 460 

TTG AAT GCC GAC TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAC 1440 
Leu Asn Ala Asp Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn 
465 470 475 480 

ACT TAT AAA GGC TGG AGC GGA TTT GTC GGC TTG GCG GCG CAG CTG AGC 1488 
Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Ser 
485 490 495 

CAA ACA TGG CGT TTG GGT TAC GAT GTG ACC TCA GGT TTC CGC GTG CCG 1536 
Gin Thr Trp Arg Leu Gly Tyr Asp Val Thr Ser Gly Phe Arg Val Pro 
500 505 510 

AAT GCG TCT GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGC ACT TGG 1584 
Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Thr Trp 
515 520 525 
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Ficrure 7 (cont'd.) 

AAG CCT AAT CCT AAT TTG AAG GCA GAA CGC AGC ACC ACC CAC ACC CTG 1632 
Lys Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu 
530 535 540 

TCC TTG CAG GGG CGC GGC GAC AAA GGG ACA CTG GAT GCC AAC CTG TAT 168 0 

Ser Leu Gin Gly Arg Gly Asp Lys Gly Thr Leu Asp Ala Asn Leu Tyr 
545 550 555 560 

CAA AGC AAT TAC CGA AAC TTC CTG TCG GAA GAG CAG AAT CTG ACT GTC 1728 
Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Asn Leu Thr Val 
565 570 575 

AGC GGC ACA CCC GGC TGT ACT GAG GAG GAT GCT TAC TAC TAT AGA TGC 17 76 

Ser Gly Thr Pro Gly Cys Thr Glu Glu Asp Ala Tyr Tyr Tyr Arg Cys 
580 585 590 

AGC GAC CCC TAC AAA GAA AAA CTG GAT TGG CAG ATG AAA AAT ATC GAC 1824 
Ser Asp Pro Tyr Lys Glu Lys Leu Asp Trp Gin Met Lys Asn He Asp 
595 600 605 

AAG GCC AGA ATC CGC GGT ATC GAG TTG ACA GGC CGT CTG AAT GTG GAC 18 72 

Lys Ala Arg He Arg Gly He Glu Leu Thr Gly Arg Leu Asn Val Asp 
610 615 620 

AAA GTA GCG TCT TTT GTT CCT GAG GGT TGG AAA CTG TTC GGC TCG CTG 192 0 

Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu 
625 630 635 640 

GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA 1968 
Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr 
645 650 655 

CAG CCG CTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC GAA 2016 
Gin Pro Leu Lys Val He Ala Gly He Asp Tyr Glu Ser Pro Ser Glu 
660 665 670 

AAA TGG GGC GTA TTC TCC CGC CTG ACC TAT CTA GGC GCG AAA AAG GTC 2064 
Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Val 
675 680 685 

AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG CCT 2112 
Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr Pro 
690 695 700 

TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT 2160 
Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr 
705 710 715 720 

GTG TTT GAT ATG TAC GGC TTC TAC AAA CCG GCT AAA AAC CTG ACT TTG 2208 
Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Ala Lys Asn Leu Thr Leu 
725 730 735 

CGT GCA GGC GTG TAC AAC CTG TTC AAC CGC AAA TAC ACC ACT TGG GAT 22 56 

Arg Ala Gly Val Tyr Asn Leu Phe Asn Arg Lys Tyr Thr Thr Trp Asp 
740 745 750 

TCC CTG CGC GGT TTA TAT AGC TAC AGC ACC ACC AAT GCG GTC GAC CGC 23 04 

Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg 
755 760 765 

GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA GGC CGC AAT TAC GCC 2352 
Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Gly Arg Asn Tyr Ala 
770 775 780 
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Figure 7 (cont'd.) 

GTA TCG CTG GAA TGG AAG TTT TAA 
Val Ser Leu Glu Trp Lys Phe * 
785 790 
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ATG AAA CCA TTA CAA ATG CTC CCT ATC GCC GCG CTG GTC GGC AGT ATT 4 8 

Met Lys Pro Leu Gin Met Leu Pro He Ala Ala Leu Val Gly Ser He 
15 10 15 

TTC GGC AAT CCG GTC TTT GCG GCA GAT GAA GCT GCA ACT GAA ACC ACA 96 
Phe Gly Asn Pro Val Phe Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAG GCA GAG GTA AAA GCA GTG CGC GTT AAA GGC CAG CGC AAT 144 
Pro Val Lys Ala Glu Val Lys Ala Val Arg Val Lys Gly Gin Arg Asn 
35 40 45 

GCG CCT GCG GCT GTG GAA CGC GTC AAC CTT AAC CGT ATC AAA CAA GAA 192 
Ala Pro Ala Ala Val Glu Arg Val Asn Leu Asn Arg He Lys Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGC TAT TCC ACC GAT GTC GGC 24 0 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAC AGC GGC CGC CAT CAA AAA GGC TTT GCT GTT CGC GGC GTG 28 8 

Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGC GTG AGC ATA GAC GGC GTA AAC CTG CCT GAT 3 36 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Asn Leu Pro Asp 
100 105 110 

TCC GAA GAA AAC TCG CTG TAC GCC CGT TAT GGC AAC TTC AAC AGC TCG 384 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGT CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAC ATC GTA AAA 432 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Asp He Val Lys 
130 135 140 

GGG GCG GAC TCT TTC AAT ACC GGC AGC GGC GCC TTG GGC GGC GGT GTG 480 
Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CGT GAC TTA CTG TTG CCT GAA CGG CAG 528 
Asn Tyr Gin Thr Leu Gin Gly Arg Asp Leu Leu Leu Pro Glu Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC ACG CGT AAC CGT GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Thr Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACC CTC GGT TTC GGC GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGG CGC GGC CAT GAA ACT GAA AGC GCG GGC AAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Lys 
210 215 220 

CGT GGT TAT CCG GTA GAG GGT GCT GGT AGC GGA GCG AAT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala Asn He Arg Gly 
225 230 235 240 

TCT GCG CGC GGT ATT CCT GAT CCG TCC CAA CAC AAA TAC CAC AGC TTC 768 
Ser Ala Arg Gly He Pro Asp Pro Ser Gin His Lys Tyr His Ser Phe 
245 250 255 
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Figure 8 (eont. 'd) . 

TTG GOT AAG ATT GCT TAT CAA ATC AAC GAC AAC CAC CGC ATC GGC GCA 816 
Leu Gly Lys lie Ala Tyr Gin He Asn Asp Asn His Arg He Gly Ala 
260 265 270 

TCG CTC AAC GGT CAG CAG GGG CAT AAT TAC ACG GTT GAA GAG TCT TAC 864 
Ser Leu Asn Gly Gin Gin Gly His Asn Tyr Thr Val Glu Glu Ser Tyr 
275 280 285 

AAC CTG CTT GCT TCT TAT TGG CGT GAA GCT GAC GAT GTC AAC AGA CGG 912 
Asn Leu Leu Ala Ser Tyr Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGT AAC ACC AAC CTC TTT TAC GAA TGG ACG CCG GAA TCC GAC CGG TTG 960 
Arg Asn Thr Asn Leu Phe Tyr Glu Trp Thr Pro Glu Ser Asp Arg Leu 
305 310 315 320 

TCT ATG GTA AAA GCG GAT GTC GAT TAT CAA AAA ACC AAA GTA TCT GCG 1008 
Ser Met Val Lys Ala Asp Val Asp Tyr Gin Lys Thr Lys Val Ser Ala 
325 330 335 

GTC AAC TAC AAA GGT TCG TTC CCG ATA GAG GAT TCT TCC ACC TTG ACA 1056 
Val Asn Tyr Lys Gly Ser Phe Pro He Glu Asp Ser Ser Thr Leu Thr 
340 345 350 

CGT AAC TAC AAT CAA AAG GAC TTG GAT GAA ATC TAC AAC CGC AGT ATG 1104 
Arg Asn Tyr Asn Gin Lys Asp Leu Asp Glu He Tyr Asn Arg Ser Met 
355 360 365 

GAT ACC CGC TTC AAA CGC ATT ACC CTG CGT TTG GAC AGC CAT CCG TTG 1152 
Asp Thr Arg Phe Lys Arg He Thr Leu Arg Leu Asp Ser His Pro Leu 
370 375 380 

CAA CTC GGG GGG GGG CGA CAC CGC CTG TCG TTT AAA ACT TTC GCC AGC 12 00 

Gin Leu Gly Gly Gly Arg His Arg Leu Ser Phe Lys Thr Phe Ala Ser 
385 390 395 400 

CGC CGT GAT TTT GAA AAC CTA AAC CGC GAC GAT TAT TAC TTC AGC GGC 1248 
Arg Arg Asp Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Gly 
405 410 415 

CGT GTT GTT CGA ACC ACC AGC AGT ATC CAG CAT CCG GTG AAA ACC ACC 1296 
Arg Val Val Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr 
420 425 430 

AAC TAC GGT TTC TCA CTG TCT GAC CAA ATT CAA TGG AAC GAC GTG TTC 1344 
Asn Tyr Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe 
435 440 445 

AGT AGC CGC GCA GGT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG 1392 
Ser Ser Arg Ala Gly He Arg Tyr Asp His Thr Lys Met Thr Pro Gin 
450 455 460 

GAA TTG AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC 1440 
Glu Leu Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala 
465 470 475 480 

AAC ACT TAT AAA GGC TGG AGC GGT TTT GTC GGC TTG GCG GCG CAA CTG 1488 
Asn Thr Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu 
485 490 495 

AAT CAG GCT TGG CGT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC 1536 
Asn Gin Ala Trp Arg Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val 
500 505 510 
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Fiemre 8 (cont'd.) 

CCC AAT GCG TCC GAA GTG TAT TTC ACT TAC AAC CAC GGT TCG GGT AAT 1584 
Pro Asn Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn 
515 520 525 

TGG CTG CCC AAT CCC AAC CTG AAA GCC GAG CGC ACG ACC ACC CAC ACC 16 32 

Trp Leu Pro Asn Pro Asn Leu Lys Ala Glu Arg Thr Thr Thr His Thr 
530 535 540 

CTC TCT CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG 1680 
Leu Ser Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu 
545 550 555 560 

TAT CAA AGC AAT TAC CGC AAT TTC CTG TCT GAA GAG CAG AAG CTG ACC 172 8 

Tyr Gin Ser Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr 
565 570 575 

ACC AGC GGC GAT GTC AGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG 1776 
Thr Ser Gly Asp Val Ser Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met 
580 585 590 

TGT AGC AAT CCT TAT TCC GAA AAA CTG GAA TGG CAG ATG CAA AAT ATC 1824 
Cys Ser Asn Pro Tyr Ser Glu Lys Leu Glu Trp Gin Met Gin Asn lie 
595 600 605 

GAC AAG GCC AGA ATC CGC GGT ATC GAG CTG ACG GGC CGT CTG AAT GTG 1872 
Asp Lys Ala Arg lie Arg Gly lie Glu Leu Thr Gly Arg Leu Asn Val 
610 615 620 

GAC AAA GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA CTG TTC GGC TCG 192 0 

Asp Lys Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser 
625 630 635 640 

CTG GGT TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC 196 8 

Leu Gly Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser 
645 650 655 

ACC CAG CCG TTG AAA GTG ATT GCC GGT ATC GAC TAT GAA AGT CCG AGC 2016 
Thr Gin Pro Leu Lys Val lie Ala Gly lie Asp Tyr Glu Ser Pro Ser 
660 665 670 

GAA AAA TGG GGC GTG TTC TCC CGC CTG ACC TAT CTG GGC GCG AAA AAG 2064 
Glu Lys Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys 
675 680 685 

GTC AAA GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC TGG GGT ACG 2112 
Val Lys Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Trp Gly Thr 
690 695 700 

CCT TTG CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT 2160 
Pro Leu Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala 
705 710 715 720 

TAT GTG TTC GAT ATG TAC GGC TTC TAC AAA CCG GTG AAA AAC CTG ACT 2208 
Tyr Val Phe Asp Met Tyr Gly Phe Tyr Lys Pro Val Lys Asn Leu Thr 
725 730 735 

TTG CGT GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG 2256 
Leu Arg Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp 
740 745 750 

GAT TCC CTG CGC GGC CTG TAT AGC TAC AGC ACC ACC AAC TCG GTC GAC 2304 
Asp Ser Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ser Val Asp 
755 760 765 
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Figure 8 (cont'd. ) 

CGC GAT GGC AAA GGC TTA GAC CGC TAC CGC GCC CCA AGC CGT AAT TAC 
Arg Asp Gly Lys Gly Leu Asp Arg Tyr Arg Ala Pro Ser Arg Asn Tvr 
770 775 780 

GCC GTA TCG CTG GAA TGG AAG TTT TAA 
Ala Val Ser Leu Glu Trp Lys Phe * 
785 790 
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Figure 9 

ATG AAA CCA TTA CAC ATG CTT CCT ATT GCC GCG CTG GTC GGC AGT ATT 4 8 

Met Lys Pro Leu His Met Leu Pro He Ala Ala Leu Val Gly Ser He 
15 10 15 

TTC GGC AAT CCG GTC TTG GCA GCG GAT GAA GCT GCA ACC GAA ACC ACA 96 
Phe Gly Asn Pro Val Leu Ala Ala Asp Glu Ala Ala Thr Glu Thr Thr 
20 25 30 

CCC GTT AAA GCA GAG ATA AAA GAA GTG CGC GTT AAA GAC CAG CTT AAT 144 
Pro Val Lys Ala Glu He Lys Glu Val Arg Val Lys Asp Gin Leu Asn 
35 40 45 

GCG CCT GCA ACC GTG GAA CGT GTC AAC CTC GGC CGC ATT CAA CAG GAA 192 
Ala Pro Ala Thr Val Glu Arg Val Asn Leu Gly Arg He Gin Gin Glu 
50 55 60 

ATG ATA CGC GAC AAC AAA GAC TTG GTG CGT TAC TCC ACC GAC GTC GGC 24 0 

Met He Arg Asp Asn Lys Asp Leu Val Arg Tyr Ser Thr Asp Val Gly 
65 70 75 80 

TTG AGC GAT AGC GGC CGC CAT CAA AAA GGC TTT GCT GTG CGC GGC GTG 288 
Leu Ser Asp Ser Gly Arg His Gin Lys Gly Phe Ala Val Arg Gly Val 
85 90 95 

GAA GGC AAC CGT GTC GGT GTC AGC ATT GAC GGC GTG AGC CTG CCT GAT 3 36 

Glu Gly Asn Arg Val Gly Val Ser He Asp Gly Val Ser Leu Pro Asp 
100 105 110 

TCG GAA GAA AAC TCA CTG TAT GCA CGT TAT GGC AAC TTC AAC AGC TCG 384 
Ser Glu Glu Asn Ser Leu Tyr Ala Arg Tyr Gly Asn Phe Asn Ser Ser 
115 120 125 

CGC CTG TCT ATC GAC CCC GAA CTC GTG CGC AAC ATC GAA ATC GCG AAG 432 
Arg Leu Ser He Asp Pro Glu Leu Val Arg Asn He Glu He Ala Lys 
130 135 140 

GGC GCT GAC TCT TTC AAT ACC GGT AGC GGC GCA TTG GGT GGC GGC GTG 4 80 

Gly Ala Asp Ser Phe Asn Thr Gly Ser Gly Ala Leu Gly Gly Gly Val 
145 150 155 160 

AAT TAC CAA ACC CTG CAA GGA CAT GAT TTG CTG TTG GAC GAC AGG CAA 52 8 

Asn Tyr Gin Thr Leu Gin Gly His Asp Leu Leu Leu Asp Asp Arg Gin 
165 170 175 

TTC GGC GTG ATG ATG AAA AAC GGT TAC AGC AGC CGC AAC CGC GAA TGG 576 
Phe Gly Val Met Met Lys Asn Gly Tyr Ser Ser Arg Asn Arg Glu Trp 
180 185 190 

ACA AAT ACA CTC GGT TTC GGT GTG AGC AAC GAC CGC GTG GAT GCC GCT 624 
Thr Asn Thr Leu Gly Phe Gly Val Ser Asn Asp Arg Val Asp Ala Ala 
195 200 205 

TTG CTG TAT TCG CAA CGT CGC GGT CAT GAG ACC GAA AGC GCG GGC GAG 672 
Leu Leu Tyr Ser Gin Arg Arg Gly His Glu Thr Glu Ser Ala Gly Glu 
210 215 220 

CGT GGC TAT CCG GTA GAG GGT GCT GGC AGC GGA GCA ATT ATC CGT GGT 720 
Arg Gly Tyr Pro Val Glu Gly Ala Gly Ser Gly Ala He He Arg Gly 
225 230 235 240 

TCG TCA CGC GGT ATC CCT GAT CCG TCC AAA CAC AAA TAC CAC AAC TTC 768 
Ser Ser Arg Gly He Pro Asp Pro Ser Lys His Lys Tyr His Asn Phe 
245 250 255 
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FicTurw 9 (cont'd.) 

TTG GGT AAG ATT GCT TAT CAA ATC AAC GAC AAG CAC CGC ATC GGC CCA 816 
Leu Gly Lys lie Ala Tyr Gin He Asn Asp Lys His Arg He Gly Pro 
260 265 270 

TCG TTT AAC GGC CAG CAG GGG CAT AAT TAC ACG ATT GAA GAG TCT TAT 864 
Ser Phe Asn Gly Gin Gin Gly His Asn Tyr Thr He Glu Glu Ser Tyr 
275 280 285 

AAC CTG ACC GCT TCT TCC TGG CGC GAA GCC GAT GAC GTA AAC AGA CGG 912 
Asn Leu Thr Ala Ser Ser Trp Arg Glu Ala Asp Asp Val Asn Arg Arg 
290 295 300 

CGC AAT GCC AAC CTC TTT TAC GAA TGG ACG CCT GAT TCA AAT TGG CTG 96 0 

Arg Asn Ala Asn Leu Phe Tyr Glu Trp Thr Pro Asp Ser Asn Trp Leu 
305 310 315 320 

TCG TCT TTG AAG GCG GAC TTC GAT TAT CAG ACA ACC AAA GTG GCG GCG 1008 
Ser Ser Leu Lys Ala Asp Phe Asp Tyr Gin Thr Thr Lys Val Ala Ala 
325 330 335 

GTT AAC AAC AAA GGC TCG TTC CCG ACG GAT TAT TCC ACC TGG ACG CGC 1056 
Val Asn Asn Lys Gly Ser Phe Pro Thr Asp Tyr Ser Thr Trp Thr Arg 
340 345 350 

AAC TAT AAT CAG AAG GAT TTG GAG AAT ATA TAC AAC CGC AGC ATG GAC 1104 
Asn Tyr Asn Gin Lys Asp Leu Glu Asn He Tyr Asn Arg Ser Met Asp 
355 360 365 

ACC CGA TTC AAA CGT TTT ACT TTG CGT ATG GAC AGC CAA CCG TTG CAA 1152 
Thr Arg Phe Lys Arg Phe Thr Leu Arg Met Asp Ser Gin Pro Leu Gin 
370 375 380 

CTG GGC GGC CAA CAT CGC TTG TCG CTT AAA ACT TTC GCC AGT CGG CGT 1200 
Leu Gly Gly Gin His Arg Leu Ser Leu Lys Thr Phe Ala Ser Arg Arg 
385 390 395 400 

GAG TTT GAA AAC TTA AAC CGC GAC GAT TAT TAC TTC AGC GAA AGA GTA 1248 
Glu Phe Glu Asn Leu Asn Arg Asp Asp Tyr Tyr Phe Ser Glu Arg Val 
405 410 415 

TCC CGT ACT ACC AGC TCG ATT CAA CAC CCC GTG AAA ACC ACT AAT TAT 1296 
Ser Arg Thr Thr Ser Ser He Gin His Pro Val Lys Thr Thr Asn Tyr 
420 425 430 

GGT TTC TCA CTG TCT GAT CAA ATC CAA TGG AAC GAC GTG TTC AGC AGC 1344 
Gly Phe Ser Leu Ser Asp Gin He Gin Trp Asn Asp Val Phe Ser Ser 
435 440 445 

CGT OCA GAT ATC CGT TAC GAT CAT ACC AAA ATG ACG CCT CAG GAA TTG 1392 
Arg Ala Asp He Arg Tyr Asp His Thr Lys Met Thr Pro Gin Glu Leu 
450 455 460 

AAT GCC GAG TGT CAT GCT TGT GAC AAA ACA CCG CCT GCA GCC AAT ACT 1440 
Asn Ala Glu Cys His Ala Cys Asp Lys Thr Pro Pro Ala Ala Asn Thr 
465 470 475 480 

TAT AAA GGC TGG AGC GGA TTT GTC GGT TTG GCG GCG CAA CTG AAT CAG 1488 
Tyr Lys Gly Trp Ser Gly Phe Val Gly Leu Ala Ala Gin Leu Asn Gin 
485 490 495 

GCT TGG CAT GTC GGT TAC GAC ATT ACT TCC GGC TAC CGT GTC CCC AAT 1536 
Ala Trp His Val Gly Tyr Asp He Thr Ser Gly Tyr Arg Val Pro Asn 
500 505 510 
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Figure 9 (cont'd.) 

GCG TCC GAA GTG TAT TTC ACT TAG AAC CAC GGT TCG GGT AAT TGG CTG 
Ala Ser Glu Val Tyr Phe Thr Tyr Asn His Gly Ser Gly Asn Trp Leu 
515 520 525 

CCC AAT CCC AAC CTG AAA GCC GAG CGC AGC ACC ACC CAC ACC CTG TCT 
Pro Asn Pro Asn Leu Lys Ala Glu Arg Ser Thr Thr His Thr Leu Ser 
530 535 540 

CTG CAA GGC CGC AGC GAA AAA GGT ACT TTG GAT GCC AAC CTG TAT CAA 
Leu Gin Gly Arg Ser Glu Lys Gly Thr Leu Asp Ala Asn Leu Tyr Gin 
545 550 555 560 

AAC AAT TAC CGC AAC TTC TTG TCT GAA GAG CAG AAG CTG ACC ACC AGC 
Asn Asn Tyr Arg Asn Phe Leu Ser Glu Glu Gin Lys Leu Thr Thr Ser 
565 570 575 

GGC GAT GTC GGC TGT ACT CAG ATG AAT TAC TAC TAC GGT ATG TGT AGC 
Gly Asp Val Gly Cys Thr Gin Met Asn Tyr Tyr Tyr Gly Met Cys Ser 
580 585 590 

AAT CCT TAT TCC GAA AAA CCG GAA TGG CAG ATG CAA AAT ATC GAT AAG 
Asn Pro Tyr Ser Glu Lys Pro Glu Trp Gin Met Gin Asn He Asp Lys 
595 600 605 

GCC CGA ATC CGT GGT CTT GAG CTG ACA GGC CGT CTG AAT GTG ACA AAA 
Ala Arg He Arg Gly Leu Glu Leu Thr Gly Arg Leu Asn Val Thr Lys 
610 615 620 

GTA GCG TCT TTT GTT CCT GAG GGC TGG AAA TTG TTC GGC TCG CTG GGT 
Val Ala Ser Phe Val Pro Glu Gly Trp Lys Leu Phe Gly Ser Leu Gly 
^25 630 635 640 

TAT GCG AAA AGC AAA CTG TCG GGC GAC AAC AGC CTG CTG TCC ACA CAG 
Tyr Ala Lys Ser Lys Leu Ser Gly Asp Asn Ser Leu Leu Ser Thr Gin 
645 650 655 

CCG CCG AAA GTG ATT GCC GGT GTC GAC TAC GAA AGC CCG AGC GAA AAA 
Pro Pro Lys Val He Ala Gly Val Asp Tyr Glu Ser Pro Ser Glu Lys 
660 665 670 

TGG GGT GTG TTC TCC CGC CTG ACT TAT CTG GGT GCG AAA AAG GCC AAA 
Trp Gly Val Phe Ser Arg Leu Thr Tyr Leu Gly Ala Lys Lys Ala Lys 
675 680 685 

GAC GCG CAA TAC ACC GTT TAT GAA AAC AAG GGC CGG GGT ACG CCT TTG 
Asp Ala Gin Tyr Thr Val Tyr Glu Asn Lys Gly Arg Gly Thr Pro Leu 
690 695 700 

CAG AAA AAG GTA AAA GAT TAC CCG TGG CTG AAC AAG TCG GCT TAT GTG 
Gin Lys Lys Val Lys Asp Tyr Pro Trp Leu Asn Lys Ser Ala Tyr Val 
705 710 715 720 

TTT GAT ATG TAC GGC TTC TAC AAA CTG GCT AAA AAC CTG ACT TTG CGT 
Phe Asp Met Tyr Gly Phe Tyr Lys Leu Ala Lys Asn Leu Thr Leu Arg 
725 730 735 

GCA GGC GTA TAT AAT GTG TTC AAC CGC AAA TAC ACC ACT TGG GAT TCC 
Ala Gly Val Tyr Asn Val Phe Asn Arg Lys Tyr Thr Thr Trp Asp Ser 
740 745 750 

CTG CGC GGT TTG TAT AGC TAC AGC ACC ACC AAC GCG GTC GAC CGA GAT 
Leu Arg Gly Leu Tyr Ser Tyr Ser Thr Thr Asn Ala Val Asp Arg Asp 
755 760 765 
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Piqure 9 (cont'd.) 

GGC AAA GGC TTA GAC CGC TAC CGC GCC TCA GGC CGT AAT TAC GCC GTA 
Gly Lys Gly Leu Asp Arg Tyr Arg Ala Ser Gly Arg Asn Tyr Ala Val 
770 775 780 

TCG CTG GAT TGG AAG TTT TGA ATTCC 
Ser Leu Asp Trp Lys Phe * 
785 790 
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HMBRA MKPLQhILPIAALVGSIFGNPVIJUUDEAATETTPVKAEIKAVRVKGQRKAP 50 

HMBRB MKPLQMPPIAALLGSIFGNPVFAXDEAATETTPVKAEVKAVRVKGQRNAP 50 

HMBRC MKPLQMLPIAALVGSIFGNPVFAADEAATETTPVKAEVKAVRVKGQRNAP 50 

HMBRMSll MKPLHMLPIAALVGSIFGNPVLAADEAATETTPVKAEIKEVRVKDQLNAP 50 



HMBRA AAVERVNLNRIKQEMIRDNKDLVRYSTDVGLSDSGRHQKGFAVRGVEGNR 100 

HMBRB AAVERVNLNRIKQEMIRDNKDLVRYSTDVGLSDRSRHQKGFAIRGVEGDR 100 

HMBRC AAVERVNLNRIKQEMIRDNKDLVRYSTDVGLSDSGRHQKGFAVRGVEGNR 100 

HMBRMSll ATVERVNLGRIQQEMIRDNKDLVRYSTDVGLSDSGRHQKGFAVRGVEGNR 100 



HMBRA VGVSIDGVNLPDSEENSLYARYGNFNSSRLSIDPELVRNIEIVKGADSFN 150 

HMBRB VGVSIDGVNLPDSEENSLYARYGNFNSSRLSIDPELVRNIDIVKGADSFN 150 

HMBRC VGVSIDGVNLPDSEENSLYARYGNFNSSRLSIDPELVRNIDIVKGADSFN 150 

HMBRMSll VGVSIDGVSLPDSEENSLYARYGNFNSSRLSIDPELVRNIEIAKGADSFN 150 



HMBRA TGSGALGGGVNYQTLQGRDLLLDDRQFGVMMKNGYSTRNREWTNTLGPGV 200 

HMBRB TGSGALGGGVNYQTLQGRDLLLPERQFGVMMKNGYSTRNREWTNTLGFGV 200 

HMBRC TGSGALGGGVNYQTLQGRDLLLPERQFGVMMKNGYSTRNREWTNTLGFGV 200 

HMBRMSll TGSGALGGGVNYQTLQGHDLLLDDRQFGVMMKNGYSSRNREWTNTLGFGV 200 



HMBRA SNDRVDAALLYSQRRGHETESAGNRGYPVEGAGKETNIRGSARGIPOPSK 250 

HMBRB SNDRVDAALLYSQRRGHETESAGKRGYPVEGAGSGANIRQSARGIFDPSQ 250 

HMBRC SNDRVDAALLYSQRRGHETESAGKRGYPVEGAGSGANIRGSARGIPDPSQ 250 

HMBRMSll SNDRVDAALLYSQRRGHETESAGERGYPVEGAGSGAIIRGSSRGIPDPSK 250 



HMBRA HKYHNFLGKIAYQINDNHRIGASLNGQQGHNYTVEESYNLTASSWREADD 300 

HMBRB HKYHSFLGKIAYQINDNHRIGASLNGQQGHMYTVEESYNLLASYWREADD 300 

HMBRC HKYHSFLGKIAYQINDNHRIGASLNGQQGHNYTVEESYNLLASYWREADD 300 

HMBRMSll HKYHNFLGKIAYQINDKHRIGPSFNGQQGHNYTIEESYNLTASSWREADD 300 



HMBRA VNRRRNANLFYEWMPDShlWLSSLKADFDYQKTKVAAIN-KGSPPT-NYTT 348 

HMBRB VinWRNTNLFYEWTPESDRLSMVKADVDYQKTlWSAVNYKGSPPT-NYTT 349 

HMBRC VNRRRNTNLFYEWTPESDRLSMVKADVDYQKTKVSAVNYKGSFPIEDSST 350 

HMBRMSll VNRRRNANLFYEWTPDSNWLSSLKADFDYQTTKVAAVNNKGSPPTD-YST 349 



HMBRA WETEYHKKEVGEIYNRSMDTRFlOirrLRIJlSHPLQLGGGIUIRLSPKTPAS 39S 

HMBRB V^EYHKKEVGEIYNRSMDTTFKRITLRKDSHPLQLGGGRHRLSPKTFAG 399 

HMBRC LTRNYNQKDLDEIYNRSMDTRFKRITLRLDSHPLQLGGGRHRLSFKTPAS 400 

HMBRMSll vmiNYNQKDLENIYITOSMimU'lWFTUlMDSQPLQLGG-QHRLSLKTFAS 3 98 



RRDFENUTODDYYFSGRVVRTTSSIQHPVKTTNyGFSLSDQIQWOTVFSS 448 

QRBFENIJIRODYYFSGRVVRTTNSIQHPVKTTNyGPSLSDQZaNNDVPSS 449 

RRDFENUIRDDYYFSGRVVRTTSSIQHPVKTTNyGPSLSDQIQWNDVFSS 450 

RREFENUTODDYYFSERVSRTTSSIQHPVKTTNyGPSLSDQlQWNDVFSS 448 



HMBRA RAGIRYDHTKMTPQELNAECHACDKTPPAANTYKGWSOFVOLAAQUIOMI 498 

HMBRB RAGIRYDHTKMTPQELNADCHACDKTPPAANTYKGWSGFVGLAAOLSQm 499 

HMBRC RAGIRYDHTKMTPQEUUECHACDKTPPAANTYKGWSGPVGLAAQLNQMf 500 
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HMBRMSi: ?^Z: = "OH?raTPQELNAECHACDKTPPAAKTYKGWSGP;GLAACLNOAW 49c 



HMBRA RVGYDITSGYRVPNASEVYFTYNHGSGNWLPNPNLKAERSTTHTLSLQGR 548 

HMBRB RLGYDVTSGFRVPNASEVYFTYNHGSGTWKPNPNLKAERSTTHTLSLQGR 549 

HMBRC RVGYDITSGYRVPNASEVYFTYNHGSGNWLPNPNLKAERTTTHTLSLQGR 550 

HMBRMSll HVGYDITSGYRVPNASEVYFTYNHGSGNWLPNPNLKAERSTTHTLSLQGR 548 

HMBRA SEKGMLDANLYQSNYRNFLSEEQKLTTSGTPGCTEENAYYSICSDPYKEK 598 

HMBRB GDKGTLDANLYQSNYRNFLSEEQNLTVSGTPGCTEEDAYYYRCSDPYKEK 599 

HMBRC SEKGTLDANLYQSNYRNFLSEEQKLTTSGDVSCTQMNYYYGMCSNPYSEK 600 

HMBRMSi: SEKGTLDANLYQNNYRNFLSEEQKLTTSGDVGCTQMNYYYGMCSNPYSEK 598 



HMBRA -DWOMKNIDKARIRGIELTGRLNVDKVASFVPEGWKLFGSLGYAKSKLSG 648 

HMBRB LDWQMKNIDKARIRGIELTGRLNVDKVASFVPEGWKLFGSLGYAKSKLSG 649 

HMBRC LEWQMQNIDKARIRGIELTGRLNVDKVASFVPEGWKLFGSLGYAKSKLSG 650 

HMBRMSi: PEWOMQNIDKARIRGLELTGRLNVTKVASFVPEGWKLFGSLGYAKSKLSG 648 



HMBRA 
HMBRB 
HMBRC 
HMBRMSll 



DNSLLSTQPLKVIAGIDYESPSEKWGVFSRLTYLGAKKVKDAQYTVYENK 
DNSLLSTQPLKVIAGIDYESPSEKWGVFSRLTYLGAKKVKDAQYTVYENK 
DNSLLSTQPLKVIAGIDYESPSEKWGVFSRLTYLGAKKVKDAQYTVYENK 
DNSLLSTQPPKVIAGVDYESPSEKWGVFSRLTYLGAKKAKDAQYTVYENK 



698 
699 

700 
698 



HMBRA GWGTPLOKKVKDYPWU^KSAYVFDMYGFYKPVKNLTLRAGVYNLFNRKYT 748 

HMBRB GWGTPLOKKVKrYPWLNKSAYVFDMYGFYKPAKNLTLRAGVYNLFNRKYT 749 

HMBRC GWGTPLQKKVKDYPWLNKSAYVFDMYGFYKPVKNLTLRAGVYNVFNRKYT 750 

HMBRMSll GRGTPLQKKVKDYPWUJKSAYWDMYGFYKIJUOILTLRAGVYNVFNRKYT 748 

HMBRA TWDSLRGLYSYSTTNAVDRDGKGLDRYRAPGRNYAVSLEWKF 790 

HMBRB TWDSLRGLYSYSTTNAVDRDGKGLDRYRAPGRNYAVSLEWKF 791 

HMBRC TWDSLRGLYSYSTTNSVDRDGKGLDRYRAPSRNYAVSLEWKF 792 

HMBRMSll TWDSLRGLYSYSTTNAVDRDGKGLDRYRASGRNYAVSLDWKF 790 



Ident icy- 
Similarity: 



671 ( 84.7%) 
92 ( 11.6%) 
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